Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Shunting yard algorithm

From Wikipedia, the free encyclopedia
Algorithm to parse a syntax with infix notation to postfix notation
This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(August 2013) (Learn how and when to remove this message)
Shunting yard algorithm
ClassParsing
Data structureStack
Worst-caseperformanceO(n){\displaystyle O(n)}
Worst-casespace complexityO(n){\displaystyle O(n)}

Incomputer science, theshunting yard algorithm is a method for parsing arithmetical or logical expressions, or a combination of both, specified ininfix notation. It can produce either a postfix notation string, also known asreverse Polish notation (RPN), or anabstract syntax tree (AST).[1] Thealgorithm was invented byEdsger Dijkstra, first published in November 1961,[2] and named because its operation resembles that of arailroad shunting yard.

Like the evaluation of RPN, the shunting yard algorithm isstack-based. Infix expressions are the form of mathematical notation most people are used to, for instance"3 + 4" or"3 + 4 × (2 − 1)". For the conversion there are two textvariables (strings), the input and the output. There is also astack that holds operators not yet added to the output queue. To convert, the program reads each symbol in order and does something based on that symbol. The result for the above examples would be (inreverse Polish notation)"3 4 +" and"3 4 2 1 − × +", respectively.

The shunting yard algorithm will correctly parse all valid infix expressions, but does not reject all invalid expressions. For example,"1 2 +" is not a valid infix expression, but would be parsed as"1 + 2". The algorithm can however reject expressions with mismatched parentheses.

The shunting yard algorithm was later generalized intooperator-precedence parsing.

A simple conversion

[edit]
  1. Input:3 + 4
  2. Push 3 to the outputqueue (whenever a number is read it is pushed to the output)
  3. Push + (or its ID) onto the operatorstack
  4. Push 4 to the output queue
  5. After reading the expression,pop the operators off the stack and add them to the output.
    In this case there is only one, "+".
  6. Output:3 4 +

This already shows a couple of rules:

  • All numbers are pushed to the output when they are read.
  • At the end of reading the expression, pop all operators off the stack and onto the output.

Graphical illustration

[edit]

Graphical illustration of algorithm, using athree-way railroad junction. The input is processed one symbol at a time: if a variable or number is found, it is copied directly to the output a), c), e), h). If the symbol is an operator, it is pushed onto the operator stack b), d), f). If the operator's precedence is lower than that of the operators at the top of the stack or the precedences are equal and the operator is left associative, then that operator is popped off the stack and added to the output g). Finally, any remaining operators are popped off the stack and added to the output i).

The algorithm in detail

[edit]
For important terms, seetoken (parser),function (mathematics),Operator associativity, andOrder of operations.
/* The functions referred to in this algorithm are simple single argument functions such as sine, inverse or factorial. *//* This implementation does not implement composite functions, functions with a variable number of arguments, or unary operators. */while there aretokens to be read:    read a tokenif the token is:    - anumber:        put it into the output queue    - afunction:        push it onto the operator stack     - anoperatoro1:while (            there is an operatoro2 at the top of the operator stack which is not a left parenthesis,and (o2 has greaterprecedence thano1or (o1 ando2 have the same precedenceando1 is left-associative))        ):            popo2 from the operator stack into the output queue        pusho1 onto the operator stack    - a",":while the operator at the top of the operator stack is not a left parenthesis:             pop the operator from the operator stack into the output queue    - aleft parenthesis (i.e. "("):        push it onto the operator stack    - aright parenthesis (i.e. ")"):while the operator at the top of the operator stack is not a left parenthesis:            {assert the operator stack is not empty}/* If the stack runs out without finding a left parenthesis, then there are mismatched parentheses. */            pop the operator from the operator stack into the output queue        {assert there is a left parenthesis at the top of the operator stack}        pop the left parenthesis from the operator stack and discard itif there is a function token at the top of the operator stack,then:            pop the function from the operator stack into the output queue/* After the while loop, pop the remaining items from the operator stack into the output queue. */while there are tokens on the operator stack:/* If the operator token on the top of the stack is a parenthesis, then there are mismatched parentheses. */    {assert the operator on top of the stack is not a (left) parenthesis}    pop the operator from the operator stack onto the output queue

To analyze the running time complexity of this algorithm, one has only to note that each token will be read once, each number, function, or operator will be printed once, and each function, operator, or parenthesis will be pushed onto the stack and popped off the stack once—therefore, there are at most a constant number of operations executed per token, and the running time is thus O(n) — linear in the size of the input.

The shunting yard algorithm can also be applied to produce prefix notation (also known asPolish notation). To do this one would simply start from the end of a string of tokens to be parsed and work backwards, reverse the output queue (therefore making the output queue an output stack), and flip the left and right parenthesis behavior (remembering that the now-left parenthesis behavior should pop until it finds a now-right parenthesis), while making sure to change theassociativity condition to right.

Detailed examples

[edit]

Input:3 + 4 × 2 ÷ ( 1 − 5 ) ^ 2 ^ 3

OperatorPrecedenceAssociativity
^4Right
×3Left
÷3Left
+2Left
2Left

The symbol ^ represents thepower operator.

TokenActionOutput
(inRPN)
Operator
stack
Notes
3Add token to output3
+Push token to stack3+
4Add token to output3 4+
×Push token to stack3 4× +× has higher precedence than +
2Add token to output3 4 2× +
÷Pop stack to output3 4 2 ×+÷ and × have same precedence
Push token to stack3 4 2 ×÷ +÷ has higher precedence than +
(Push token to stack3 4 2 ×( ÷ +
1Add token to output3 4 2 × 1( ÷ +
Push token to stack3 4 2 × 1− ( ÷ +
5Add token to output3 4 2 × 1 5− ( ÷ +
)Pop stack to output3 4 2 × 1 5 −( ÷ +Repeated until "(" found
Pop stack3 4 2 × 1 5 −÷ +Discard matching parenthesis
^Push token to stack3 4 2 × 1 5 −^ ÷ +^ has higher precedence than ÷
2Add token to output3 4 2 × 1 5 − 2^ ÷ +
^Push token to stack3 4 2 × 1 5 − 2^ ^ ÷ +^ is evaluated right-to-left
3Add token to output3 4 2 × 1 5 − 2 3^ ^ ÷ +
endPop entire stack to output3 4 2 × 1 5 − 2 3 ^ ^ ÷ +

Input:sin ( max ( 2, 3 ) ÷ 3 ×π )

TokenActionOutput
(inRPN)
Operator
stack
Notes
sinPush token to stacksin
(Push token to stack( sin
maxPush token to stackmax ( sin
(Push token to stack( max ( sin
2Add token to output2( max ( sin
,Ignore2( max ( sinThe operator at the top of the stack is a left parenthesis
3Add token to output2 3( max ( sin
)Pop stack to output2 3( max ( sinRepeated until "(" is at the top of the stack
Pop stack2 3max ( sinDiscarding matching parentheses
Pop stack to output2 3 max( sinFunction at top of the stack
÷Push token to stack2 3 max÷ ( sin
3Add token to output2 3 max 3÷ ( sin
×Pop stack to output2 3 max 3 ÷( sin
Push token to stack2 3 max 3 ÷× ( sin
πAdd token to output2 3 max 3 ÷π× ( sin
)Pop stack to output2 3 max 3 ÷π ×( sinRepeated until "(" is at the top of the stack
Pop stack2 3 max 3 ÷π ×sinDiscarding matching parentheses
Pop stack to output2 3 max 3 ÷π × sinFunction at top of the stack
endPop entire stack to output2 3 max 3 ÷π × sin

See also

[edit]

References

[edit]
  1. ^Theodore Norvell (1999)."Parsing Expressions by Recursive Descent".www.engr.mun.ca. Retrieved2020-12-28.
  2. ^Dijkstra, Edsger (1961-11-01)."Algol 60 translation : An Algol 60 translator for the X1 and making a translator for Algol 60".Stichting Mathematisch Centrum.

External links

[edit]
Top-down
Bottom-up
Mixed, other
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Shunting_yard_algorithm&oldid=1323863175"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp