Movatterモバイル変換

InputElementRegExpOrTemplateTail

InputElementHashbangOrRegExp

TemplateSubstitutionTail

InputElementTemplateTail

TemplateSubstitutionTail

12.1 Unicode Format-Control Characters

The Unicode format-control characters (i.e., the characters in category “Cf” in the Unicode Character Database such as LEFT-TO-RIGHT MARK or RIGHT-TO-LEFT MARK) are control codes used to control the formatting of a range of text in the absence of higher-level protocols for this (such as mark-up languages).

It is useful to allow format-control characters in source text to facilitate editing and display. All format control characters may be used within comments, and within string literals, template literals, and regular expression literals.

U+FEFF (ZERO WIDTH NO-BREAK SPACE) is a format-control character used primarily at the start of a text to mark it as Unicode and to allow detection of the text's encoding and byte order. <ZWNBSP> characters intended for this purpose can sometimes also appear after the start of a text, for example as a result of concatenating files. InECMAScript source text <ZWNBSP> code points are treated as white space characters (see12.2) outside of comments, string literals, template literals, and regular expression literals.

12.2 White Space

White space code points are used to improve source text readability and to separate tokens (indivisible lexical units) from each other, but are otherwise insignificant. White space code points may occur between any two tokens and at the start or end of input. White space code points may occur within aStringLiteral, aRegularExpressionLiteral, aTemplate, or aTemplateSubstitutionTail where they are considered significant code points forming part of a literal value. They may also occur within aComment, but cannot appear within any other kind of token.

The ECMAScript white space code points are listed inTable 33.

Table 33: White Space Code Points

Code Points	Name	Abbreviation
`U+0009`	CHARACTER TABULATION	<TAB>
`U+000B`	LINE TABULATION	<VT>
`U+000C`	FORM FEED (FF)	<FF>
`U+FEFF`	ZERO WIDTH NO-BREAK SPACE	<ZWNBSP>
any code point in general category “Space_Separator”		<USP>

Note 1

U+0020 (SPACE) and U+00A0 (NO-BREAK SPACE) code points are part of <USP>.

Note 2

Other than for the code points listed inTable 33, ECMAScriptWhiteSpace intentionally excludes all code points that have the Unicode “White_Space” property but which are not classified in general category “Space_Separator” (“Zs”).

Syntax

WhiteSpace

<TAB>

<VT>

<FF>

<USP>

12.3 Line Terminators

Like white space code points, line terminator code points are used to improve source text readability and to separate tokens (indivisible lexical units) from each other. However, unlike white space code points, line terminators have some influence over the behaviour of the syntactic grammar. In general, line terminators may occur between any two tokens, but there are a few places where they are forbidden by the syntactic grammar. Line terminators also affect the process of automatic semicolon insertion (12.10). A line terminator cannot occur within any token except aStringLiteral,Template, orTemplateSubstitutionTail. <LF> and <CR> line terminators cannot occur within aStringLiteral token except as part of aLineContinuation.

A line terminator can occur within aMultiLineComment but cannot occur within aSingleLineComment.

Line terminators are included in the set of white space code points that are matched by the\s class in regular expressions.

The ECMAScript line terminator code points are listed inTable 34.

Table 34: Line Terminator Code Points

Code Point	Unicode Name	Abbreviation
`U+000A`	LINE FEED (LF)	<LF>
`U+000D`	CARRIAGE RETURN (CR)	<CR>
`U+2028`	LINE SEPARATOR	<LS>
`U+2029`	PARAGRAPH SEPARATOR	<PS>

Only the Unicode code points inTable 34 are treated as line terminators. Other new line or line breaking Unicode code points are not treated as line terminators but are treated as white space if they meet the requirements listed inTable 33. The sequence <CR><LF> is commonly used as a line terminator. It should be considered a singleSourceCharacter for the purpose of reporting line numbers.

Syntax

LineTerminator

<LF>

<CR>

<LS>

<PS>

LineTerminatorSequence

<LF>

<CR>

[lookahead ≠<LF>]

<LS>

<PS>

<CR>

<LF>

12.4 Comments

Comments can be either single or multi-line. Multi-line comments cannot nest.

Because a single-line comment can contain any Unicode code point except aLineTerminator code point, and because of the general rule that a token is always as long as possible, a single-line comment always consists of all code points from the// marker to the end of the line. However, theLineTerminator at the end of the line is not considered to be part of the single-line comment; it is recognized separately by the lexical grammar and becomes part of the stream of input elements for the syntactic grammar. This point is very important, because it implies that the presence or absence of single-line comments does not affect the process of automatic semicolon insertion (see12.10).

Comments behave like white space and are discarded except that, if aMultiLineComment contains a line terminator code point, then the entire comment is considered to be aLineTerminator for purposes of parsing by the syntactic grammar.

Syntax

opt

MultiLineNotAsteriskChar

MultiLineNotForwardSlashOrAsteriskChar

opt

PostAsteriskCommentChars

opt

PostAsteriskCommentChars

opt

PostAsteriskCommentChars

opt

MultiLineNotAsteriskChar

MultiLineNotForwardSlashOrAsteriskChar

but not*

but not one of/ or*

SingleLineComment

opt

SingleLineCommentChar

opt

SingleLineCommentChar

but notLineTerminator

A number of productions in this section are given alternative definitions in sectionB.1.1

12.5 Hashbang Comments

Hashbang Comments are location-sensitive and like other types of comments are discarded from the stream of input elements for the syntactic grammar.

Syntax

HashbangComment

opt

12.6 Tokens

Syntax

Note

TheDivPunctuator,RegularExpressionLiteral,RightBracePunctuator, andTemplateSubstitutionTail productions derive additional tokens that are not included in theCommonToken production.

12.7 Names and Keywords

IdentifierName andReservedWord are tokens that are interpreted according to the Default Identifier Syntax given in Unicode Standard Annex #31, Identifier and Pattern Syntax, with some small modifications.ReservedWord is an enumerated subset ofIdentifierName. The syntactic grammar definesIdentifier as anIdentifierName that is not aReservedWord. The Unicode identifier grammar is based on character properties specified by the Unicode Standard. The Unicode code points in the specified categories in the latest version of the Unicode Standard must be treated as in those categories by all conforming ECMAScript implementations. ECMAScript implementations may recognize identifier code points defined in later editions of the Unicode Standard.

Note 1

This standard specifies specific code point additions: U+0024 (DOLLAR SIGN) and U+005F (LOW LINE) are permitted anywhere in anIdentifierName.

Syntax

IdentifierPartChar

one of

any Unicode code point with the Unicode property “ID_Start”

UnicodeIDContinue

any Unicode code point with the Unicode property “ID_Continue”

The definitions of the nonterminalUnicodeEscapeSequence is given in12.9.4.

Note 2

The nonterminalIdentifierPart derives_ viaUnicodeIDContinue.

Note 3

The sets of code points with Unicode properties “ID_Start” and “ID_Continue” include, respectively, the code points with Unicode properties “Other_ID_Start” and “Other_ID_Continue”.

12.7.1 Identifier Names

Unicode escape sequences are permitted in anIdentifierName, where they contribute a single Unicode code point equal to theIdentifierCodePoint of theUnicodeEscapeSequence. The\ preceding theUnicodeEscapeSequence does not contribute any code points. AUnicodeEscapeSequence cannot be used to contribute a code point to anIdentifierName that would otherwise be invalid. In other words, if a\UnicodeEscapeSequence sequence were replaced by theSourceCharacter it contributes, the result must still be a validIdentifierName that has the exact same sequence ofSourceCharacter elements as the originalIdentifierName. All interpretations ofIdentifierName within this specification are based upon their actual code points regardless of whether or not an escape sequence was used to contribute any particular code point.

TwoIdentifierNames that are canonically equivalent according to the Unicode Standard arenot equal unless, after replacement of eachUnicodeEscapeSequence, they are represented by the exact same sequence of code points.

12.7.1.1 Static Semantics: Early Errors

IdentifierStart

It is a Syntax Error if theIdentifierCodePoint ofUnicodeEscapeSequence is not some Unicode code point matched by theIdentifierStartChar lexical grammar production.

It is a Syntax Error if theIdentifierCodePoint ofUnicodeEscapeSequence is not some Unicode code point matched by theIdentifierPartChar lexical grammar production.

12.7.1.2 Static Semantics: IdentifierCodePoints

Thesyntax-directed operation IdentifierCodePoints takes no arguments and returns aList of code points. It is defined piecewise over the following productions:

IdentifierName

IdentifierStart

Letcp be theIdentifierCodePoint ofIdentifierStart.
Return «cp ».

IdentifierName

Letcps be theIdentifierCodePoints of the derivedIdentifierName.
Letcp be theIdentifierCodePoint ofIdentifierPart.
Return thelist-concatenation ofcps and «cp ».

12.7.1.3 Static Semantics: IdentifierCodePoint

Thesyntax-directed operation IdentifierCodePoint takes no arguments and returns a code point. It is defined piecewise over the following productions:

IdentifierStart

IdentifierStartChar

Return the code point matched byIdentifierStartChar.

IdentifierPartChar

Return the code point matched byIdentifierPartChar.

Hex4Digits

Return the code point whose numeric value is the MV ofHex4Digits.

OptionalChainingPunctuator

CodePoint

}

Return the code point whose numeric value is the MV ofCodePoint.

12.7.2 Keywords and Reserved Words

Akeyword is a token that matchesIdentifierName, but also has a syntactic use; that is, it appears literally, in afixed width font, in some syntactic production. The keywords of ECMAScript includeif,while,async,await, and many others.

Areserved word is anIdentifierName that cannot be used as an identifier. Many keywords are reserved words, but some are not, and some are reserved only in certain contexts.if andwhile are reserved words.await is reserved only inside async functions and modules.async is not reserved; it can be used as a variable name or statement label without restriction.

This specification uses a combination of grammatical productions andearly error rules to specify which names are valid identifiers and which are reserved words. All tokens in theReservedWord list below, except forawait andyield, are unconditionally reserved. Exceptions forawait andyield are specified in13.1, using parameterized syntactic productions. Lastly, severalearly error rules restrict the set of valid identifiers. See13.1.1,14.3.1.1,14.7.5.1, and15.7.1. In summary, there are five categories of identifier names:

Those that are always allowed as identifiers, and are not keywords, such asMath,window,toString, and_;
Those that are never allowed as identifiers, namely theReservedWords listed below exceptawait andyield;
Those that are contextually allowed as identifiers, namelyawait andyield;
Those that are contextually disallowed as identifiers, instrict mode code:let,static,implements,interface,package,private,protected, andpublic;
Those that are always allowed as identifiers, but also appear as keywords within certain syntactic productions, at places whereIdentifier is not allowed:as,async,from,get,meta,of,set, andtarget.

The termconditional keyword, orcontextual keyword, is sometimes used to refer to the keywords that fall in the last three categories, and thus can be used as identifiers in some contexts and as keywords in others.

Syntax

ReservedWord

one of

await

break

case

catch

class

const

continue

debugger

default

delete

else

enum

export

extends

false

finally

for

function

import

instanceof

new

null

return

super

switch

this

throw

true

try

typeof

var

void

while

with

yield

Note 1

Per5.1.5, keywords in the grammar match literal sequences of specificSourceCharacter elements. A code point in a keyword cannot be expressed by a\UnicodeEscapeSequence.

AnIdentifierName can contain\UnicodeEscapeSequences, but it is not possible to declare a variable named "else" by spelling itels\u{65}. Theearly error rules in13.1.1 rule out identifiers with the sameStringValue as a reserved word.

Note 2

enum is not currently used as a keyword in this specification. It is afuture reserved word, set aside for use as a keyword in future language extensions.

Similarly,implements,interface,package,private,protected, andpublic are future reserved words instrict mode code.

Note 3

The namesarguments andeval are not keywords, but they are subject to some restrictions instrict mode code. See13.1.1,8.6.4,15.2.1,15.5.1,15.6.1, and15.8.1.

12.8 Punctuators

Syntax

Punctuator

OtherPunctuator

OptionalChainingPunctuator

[lookahead ∉DecimalDigit]

OtherPunctuator

one of

{

(

)

[

]

...

;

===

!==

>>>

**=

<<=

>>=

>>>=

&&=

||=

??=

DivPunctuator

RightBracePunctuator

}

12.9 Literals

12.9.1 Null Literals

Syntax

NullLiteral

null

12.9.2 Boolean Literals

Syntax

BooleanLiteral

true

false

12.9.3 Numeric Literals

Syntax

DecimalLiteral

[+Sep]

[+Sep]

[+Sep]

opt

[+Sep]

[Sep]

[?Sep]

[?Sep]

[?Sep]

[+Sep]

opt

ExponentPart

[+Sep]

opt

[+Sep]

ExponentPart

[+Sep]

opt

ExponentPart

[+Sep]

opt

opt

[+Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

one of

one of

[Sep]

[?Sep]

one of

[Sep]

[?Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

one of

[Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

LegacyOctalLikeDecimalIntegerLiteral

NonOctalDigit

NonOctalDigit

LegacyOctalLikeDecimalIntegerLiteral

DecimalDigit

LegacyOctalLikeDecimalIntegerLiteral

one of

one of

[Sep]

[?Sep]

[?Sep]

[Sep]

[?Sep]

[+Sep]

[+Sep]

one of

TheSourceCharacter immediately following aNumericLiteral must not be anIdentifierStart orDecimalDigit.

Note

For example:3in is an error and not the two input elements3 andin.

12.9.3.1 Static Semantics: Early Errors

It is a Syntax Error ifIsStrict(this production) istrue.

Note

Innon-strict code, this syntax isLegacy.

12.9.3.2 Static Semantics: MV

A numeric literal stands for a value of theNumber type or theBigInt type.

The MV ofDecimalLiteral::DecimalIntegerLiteral.DecimalDigits is the MV ofDecimalIntegerLiteral plus (the MV ofDecimalDigits × 10^-n), wheren is the number of code points inDecimalDigits, excluding all occurrences ofNumericLiteralSeparator.
The MV ofDecimalLiteral::DecimalIntegerLiteral.ExponentPart is the MV ofDecimalIntegerLiteral × 10^e, wheree is the MV ofExponentPart.
The MV ofDecimalLiteral::DecimalIntegerLiteral.DecimalDigitsExponentPart is (the MV ofDecimalIntegerLiteral plus (the MV ofDecimalDigits × 10^-n)) × 10^e, wheren is the number of code points inDecimalDigits, excluding all occurrences ofNumericLiteralSeparator ande is the MV ofExponentPart.
The MV ofDecimalLiteral::.DecimalDigits is the MV ofDecimalDigits × 10^-n, wheren is the number of code points inDecimalDigits, excluding all occurrences ofNumericLiteralSeparator.
The MV ofDecimalLiteral::.DecimalDigitsExponentPart is the MV ofDecimalDigits × 10^{e -n}, wheren is the number of code points inDecimalDigits, excluding all occurrences ofNumericLiteralSeparator, ande is the MV ofExponentPart.
The MV ofDecimalLiteral::DecimalIntegerLiteralExponentPart is the MV ofDecimalIntegerLiteral × 10^e, wheree is the MV ofExponentPart.
The MV ofDecimalIntegerLiteral::0 is 0.
The MV ofDecimalIntegerLiteral::NonZeroDigitNumericLiteralSeparatoroptDecimalDigits is (the MV ofNonZeroDigit × 10ⁿ) plus the MV ofDecimalDigits, wheren is the number of code points inDecimalDigits, excluding all occurrences ofNumericLiteralSeparator.
The MV ofDecimalDigits::DecimalDigitsDecimalDigit is (the MV ofDecimalDigits × 10) plus the MV ofDecimalDigit.
The MV ofDecimalDigits::DecimalDigitsNumericLiteralSeparatorDecimalDigit is (the MV ofDecimalDigits × 10) plus the MV ofDecimalDigit.
The MV ofExponentPart::ExponentIndicatorSignedInteger is the MV ofSignedInteger.
The MV ofSignedInteger::-DecimalDigits is the negative of the MV ofDecimalDigits.
The MV ofDecimalDigit::0 or ofHexDigit::0 or ofOctalDigit::0 or ofLegacyOctalEscapeSequence::0 or ofBinaryDigit::0 is 0.
The MV ofDecimalDigit::1 or ofNonZeroDigit::1 or ofHexDigit::1 or ofOctalDigit::1 or ofBinaryDigit::1 is 1.
The MV ofDecimalDigit::2 or ofNonZeroDigit::2 or ofHexDigit::2 or ofOctalDigit::2 is 2.
The MV ofDecimalDigit::3 or ofNonZeroDigit::3 or ofHexDigit::3 or ofOctalDigit::3 is 3.
The MV ofDecimalDigit::4 or ofNonZeroDigit::4 or ofHexDigit::4 or ofOctalDigit::4 is 4.
The MV ofDecimalDigit::5 or ofNonZeroDigit::5 or ofHexDigit::5 or ofOctalDigit::5 is 5.
The MV ofDecimalDigit::6 or ofNonZeroDigit::6 or ofHexDigit::6 or ofOctalDigit::6 is 6.
The MV ofDecimalDigit::7 or ofNonZeroDigit::7 or ofHexDigit::7 or ofOctalDigit::7 is 7.
The MV ofDecimalDigit::8 or ofNonZeroDigit::8 or ofNonOctalDigit::8 or ofHexDigit::8 is 8.
The MV ofDecimalDigit::9 or ofNonZeroDigit::9 or ofNonOctalDigit::9 or ofHexDigit::9 is 9.
The MV ofHexDigit::a or ofHexDigit::A is 10.
The MV ofHexDigit::b or ofHexDigit::B is 11.
The MV ofHexDigit::c or ofHexDigit::C is 12.
The MV ofHexDigit::d or ofHexDigit::D is 13.
The MV ofHexDigit::e or ofHexDigit::E is 14.
The MV ofHexDigit::f or ofHexDigit::F is 15.
The MV ofBinaryDigits::BinaryDigitsBinaryDigit is (the MV ofBinaryDigits × 2) plus the MV ofBinaryDigit.
The MV ofBinaryDigits::BinaryDigitsNumericLiteralSeparatorBinaryDigit is (the MV ofBinaryDigits × 2) plus the MV ofBinaryDigit.
The MV ofOctalDigits::OctalDigitsOctalDigit is (the MV ofOctalDigits × 8) plus the MV ofOctalDigit.
The MV ofOctalDigits::OctalDigitsNumericLiteralSeparatorOctalDigit is (the MV ofOctalDigits × 8) plus the MV ofOctalDigit.
The MV ofLegacyOctalIntegerLiteral::LegacyOctalIntegerLiteralOctalDigit is (the MV ofLegacyOctalIntegerLiteral times 8) plus the MV ofOctalDigit.
The MV ofNonOctalDecimalIntegerLiteral::LegacyOctalLikeDecimalIntegerLiteralNonOctalDigit is (the MV ofLegacyOctalLikeDecimalIntegerLiteral times 10) plus the MV ofNonOctalDigit.
The MV ofNonOctalDecimalIntegerLiteral::NonOctalDecimalIntegerLiteralDecimalDigit is (the MV ofNonOctalDecimalIntegerLiteral times 10) plus the MV ofDecimalDigit.
The MV ofLegacyOctalLikeDecimalIntegerLiteral::LegacyOctalLikeDecimalIntegerLiteralOctalDigit is (the MV ofLegacyOctalLikeDecimalIntegerLiteral times 10) plus the MV ofOctalDigit.
The MV ofHexDigits::HexDigitsHexDigit is (the MV ofHexDigits × 16) plus the MV ofHexDigit.
The MV ofHexDigits::HexDigitsNumericLiteralSeparatorHexDigit is (the MV ofHexDigits × 16) plus the MV ofHexDigit.

12.9.3.3 Static Semantics: NumericValue

Thesyntax-directed operation NumericValue takes no arguments and returns a Number or a BigInt. It is defined piecewise over the following productions:

DecimalLiteral

ReturnRoundMVResult(MV ofDecimalLiteral).

Return𝔽(MV ofNonDecimalIntegerLiteral).

Return𝔽(MV ofLegacyOctalIntegerLiteral).

Return theBigInt value for the MV ofNonDecimalIntegerLiteral.

Return0_ℤ.

Return theBigInt value for the MV ofNonZeroDigit.

Letn be the number of code points inDecimalDigits, excluding all occurrences ofNumericLiteralSeparator.
Letmv be (the MV ofNonZeroDigit × 10ⁿ) plus the MV ofDecimalDigits.
Returnℤ(mv).

12.9.4 String Literals

Note 1

A string literal is 0 or more Unicode code points enclosed in single or double quotes. Unicode code points may also be represented by an escape sequence. All code points may appear literally in a string literal except for the closing quote code points, U+005C (REVERSE SOLIDUS), U+000D (CARRIAGE RETURN), and U+000A (LINE FEED). Any code points may appear in the form of an escape sequence. String literals evaluate to ECMAScript String values. When generating these String values Unicode code points are UTF-16 encoded as defined in11.1.1. Code points belonging to the Basic Multilingual Plane are encoded as a single code unit element of the string. All other code points are encoded as two code unit elements of the string.

Syntax

StringLiteral

DoubleStringCharacters

opt

SingleStringCharacters

opt

DoubleStringCharacters

DoubleStringCharacter

DoubleStringCharacters

opt

SingleStringCharacters

SingleStringCharacter

SingleStringCharacters

opt

DoubleStringCharacter

but not one of" or\ orLineTerminator

<LS>

<PS>

LineContinuation

SingleStringCharacter

but not one of' or\ orLineTerminator

<LS>

<PS>

LineContinuation

LineTerminatorSequence

LegacyOctalEscapeSequence

CharacterEscapeSequence

[lookahead ∉DecimalDigit]

NonOctalDecimalEscapeSequence

HexEscapeSequence

CharacterEscapeSequence

SingleEscapeCharacter

NonEscapeCharacter

SingleEscapeCharacter

one of

NonEscapeCharacter

LegacyOctalEscapeSequence

but not one ofEscapeCharacter orLineTerminator

EscapeCharacter

SingleEscapeCharacter

DecimalDigit

[lookahead ∈ {8,9 }]

NonZeroOctalDigit

[lookahead ∉OctalDigit]

ZeroToThree

NonOctalDecimalEscapeSequence

[lookahead ∉OctalDigit]

but not0

one of

one of

one of

HexEscapeSequence

}

The definition of the nonterminalHexDigit is given in12.9.3.SourceCharacter is defined in11.1.

Note 2

<LF> and <CR> cannot appear in a string literal, except as part of aLineContinuation to produce the empty code points sequence. The proper way to include either in the String value of a string literal is to use an escape sequence such as\n or\u000A.

12.9.4.1 Static Semantics: Early Errors

LegacyOctalEscapeSequence

NonOctalDecimalEscapeSequence

It is a Syntax Error ifIsStrict(this production) istrue.

Note 1

Innon-strict code, this syntax isLegacy.

Note 2

It is possible for string literals to precede aUse Strict Directive that places the enclosing code instrict mode, and implementations must take care to enforce the above rules for such literals. For example, the following source text contains a Syntax Error:

functioninvalid() {"\7";"use strict"; }

12.9.4.2 Static Semantics: SV

Thesyntax-directed operation SV takes no arguments and returns a String.

A string literal stands for a value of theString type. SV produces String values for string literals through recursive application on the various parts of the string literal. As part of this process, some Unicode code points within the string literal are interpreted as having amathematical value, as described below or in12.9.3.

The SV ofStringLiteral::"" is the empty String.
The SV ofStringLiteral::'' is the empty String.
The SV ofDoubleStringCharacters::DoubleStringCharacterDoubleStringCharacters is thestring-concatenation of the SV ofDoubleStringCharacter and the SV ofDoubleStringCharacters.
The SV ofSingleStringCharacters::SingleStringCharacterSingleStringCharacters is thestring-concatenation of the SV ofSingleStringCharacter and the SV ofSingleStringCharacters.
The SV ofDoubleStringCharacter::SourceCharacterbut not one of" or\ orLineTerminator is the result of performingUTF16EncodeCodePoint on the code point matched bySourceCharacter.
The SV ofDoubleStringCharacter::<LS> is the String value consisting of the code unit 0x2028 (LINE SEPARATOR).
The SV ofDoubleStringCharacter::<PS> is the String value consisting of the code unit 0x2029 (PARAGRAPH SEPARATOR).
The SV ofDoubleStringCharacter::LineContinuation is the empty String.
The SV ofSingleStringCharacter::SourceCharacterbut not one of' or\ orLineTerminator is the result of performingUTF16EncodeCodePoint on the code point matched bySourceCharacter.
The SV ofSingleStringCharacter::<LS> is the String value consisting of the code unit 0x2028 (LINE SEPARATOR).
The SV ofSingleStringCharacter::<PS> is the String value consisting of the code unit 0x2029 (PARAGRAPH SEPARATOR).
The SV ofSingleStringCharacter::LineContinuation is the empty String.
The SV ofEscapeSequence::0 is the String value consisting of the code unit 0x0000 (NULL).
The SV ofCharacterEscapeSequence::SingleEscapeCharacter is the String value consisting of the code unit whose numeric value is determined by theSingleEscapeCharacter according toTable 35.

Table 35: String Single Character Escape Sequences

Escape Sequence	Code Unit Value	Unicode Character Name	Symbol
`\b`	`0x0008`	BACKSPACE	<BS>
`\t`	`0x0009`	CHARACTER TABULATION	<HT>
`\n`	`0x000A`	LINE FEED (LF)	<LF>
`\v`	`0x000B`	LINE TABULATION	<VT>
`\f`	`0x000C`	FORM FEED (FF)	<FF>
`\r`	`0x000D`	CARRIAGE RETURN (CR)	<CR>
`\"`	`0x0022`	QUOTATION MARK	`"`
`\'`	`0x0027`	APOSTROPHE	`'`
`\\`	`0x005C`	REVERSE SOLIDUS	`\`

The SV ofNonEscapeCharacter::SourceCharacterbut not one ofEscapeCharacter orLineTerminator is the result of performingUTF16EncodeCodePoint on the code point matched bySourceCharacter.
The SV ofEscapeSequence::LegacyOctalEscapeSequence is the String value consisting of the code unit whose numeric value is the MV ofLegacyOctalEscapeSequence.
The SV ofNonOctalDecimalEscapeSequence::8 is the String value consisting of the code unit 0x0038 (DIGIT EIGHT).
The SV ofNonOctalDecimalEscapeSequence::9 is the String value consisting of the code unit 0x0039 (DIGIT NINE).
The SV ofHexEscapeSequence::xHexDigitHexDigit is the String value consisting of the code unit whose numeric value is the MV ofHexEscapeSequence.
The SV ofHex4Digits::HexDigitHexDigitHexDigitHexDigit is the String value consisting of the code unit whose numeric value is the MV ofHex4Digits.
The SV ofUnicodeEscapeSequence::u{CodePoint} is the result of performingUTF16EncodeCodePoint on the MV ofCodePoint.
The SV ofTemplateEscapeSequence::0 is the String value consisting of the code unit 0x0000 (NULL).

12.9.4.3 Static Semantics: MV

The MV ofLegacyOctalEscapeSequence::ZeroToThreeOctalDigit is (8 times the MV ofZeroToThree) plus the MV ofOctalDigit.
The MV ofLegacyOctalEscapeSequence::FourToSevenOctalDigit is (8 times the MV ofFourToSeven) plus the MV ofOctalDigit.
The MV ofLegacyOctalEscapeSequence::ZeroToThreeOctalDigitOctalDigit is (64 (that is, 8²) times the MV ofZeroToThree) plus (8 times the MV of the firstOctalDigit) plus the MV of the secondOctalDigit.
The MV ofZeroToThree::0 is 0.
The MV ofZeroToThree::1 is 1.
The MV ofZeroToThree::2 is 2.
The MV ofZeroToThree::3 is 3.
The MV ofFourToSeven::4 is 4.
The MV ofFourToSeven::5 is 5.
The MV ofFourToSeven::6 is 6.
The MV ofFourToSeven::7 is 7.
The MV ofHexEscapeSequence::xHexDigitHexDigit is (16 times the MV of the firstHexDigit) plus the MV of the secondHexDigit.
The MV ofHex4Digits::HexDigitHexDigitHexDigitHexDigit is (0x1000 × the MV of the firstHexDigit) plus (0x100 × the MV of the secondHexDigit) plus (0x10 × the MV of the thirdHexDigit) plus the MV of the fourthHexDigit.

12.9.5 Regular Expression Literals

Note 1

A regular expression literal is an input element that is converted to a RegExp object (see22.2) each time the literal is evaluated. Two regular expression literals in a program evaluate to regular expression objects that never compare as=== to each other even if the two literals' contents are identical. A RegExp object may also be created at runtime bynew RegExp or calling the RegExpconstructor as a function (see22.2.4).

The productions below describe the syntax for a regular expression literal and are used by the input element scanner to find the end of the regular expression literal. The source text comprising theRegularExpressionBody and theRegularExpressionFlags are subsequently parsed again using the more stringent ECMAScript Regular Expression grammar (22.2.1).

An implementation may extend the ECMAScript Regular Expression grammar defined in22.2.1, but it must not extend theRegularExpressionBody andRegularExpressionFlags productions defined below or the productions used by these productions.

Syntax

RegularExpressionFirstChar

RegularExpressionChars

[empty]

RegularExpressionChars

RegularExpressionChar

RegularExpressionFirstChar

but not one of* or\ or/ or[

RegularExpressionClass

RegularExpressionChar

but not one of\ or/ or[

RegularExpressionClass

RegularExpressionClassChars

but notLineTerminator

RegularExpressionClass

[

]

RegularExpressionClassChars

[empty]

RegularExpressionClassChars

RegularExpressionClassChar

but not one of] or\

[empty]

IdentifierPartChar

Note 2

Regular expression literals may not be empty; instead of representing an empty regular expression literal, the code unit sequence// starts a single-line comment. To specify an empty regular expression, use:/(?:)/.

12.9.5.1 Static Semantics: BodyText

Thesyntax-directed operation BodyText takes no arguments and returns source text. It is defined piecewise over the following productions:

Return the source text that was recognized asRegularExpressionBody.

12.9.5.2 Static Semantics: FlagText

Thesyntax-directed operation FlagText takes no arguments and returns source text. It is defined piecewise over the following productions:

Return the source text that was recognized asRegularExpressionFlags.

12.9.6 Template Literal Lexical Components

Syntax

Template

NoSubstitutionTemplate

TemplateHead

NoSubstitutionTemplate

TemplateCharacters

opt

TemplateHead

TemplateCharacters

opt

TemplateSubstitutionTail

}

opt

}

opt

opt

[lookahead ≠{]

TemplateEscapeSequence

NotEscapeSequence

LineContinuation

LineTerminatorSequence

but not one of` or\ or$ orLineTerminator

TemplateEscapeSequence

CharacterEscapeSequence

[lookahead ∉DecimalDigit]

HexEscapeSequence

NotEscapeSequence

DecimalDigit

but not0

[lookahead ∉HexDigit]

[lookahead ∉HexDigit]

[lookahead ≠{]

[lookahead ∉HexDigit]

[lookahead ∉HexDigit]

[lookahead ∉HexDigit]

{

[lookahead ∉HexDigit]

{

NotCodePoint

[lookahead ∉HexDigit]

{

CodePoint

[lookahead ∉HexDigit]

[lookahead ≠}]

NotCodePoint

HexDigits

[~Sep]

but only if the MV ofHexDigits > 0x10FFFF

CodePoint

HexDigits

[~Sep]

but only if the MV ofHexDigits ≤ 0x10FFFF

Note

TemplateSubstitutionTail is used by theInputElementTemplateTail alternative lexical goal.

12.9.6.1 Static Semantics: TV

Thesyntax-directed operation TV takes no arguments and returns a String orundefined. A template literal component is interpreted by TV as a value of theString type. TV is used to construct the indexed components of a template object (colloquially, the template values). In TV, escape sequences are replaced by the UTF-16 code unit(s) of the Unicode code point represented by the escape sequence.

The TV ofNoSubstitutionTemplate::`` is the empty String.
The TV ofTemplateHead::`${ is the empty String.
The TV ofTemplateMiddle::}${ is the empty String.
The TV ofTemplateTail::}` is the empty String.
The TV ofTemplateCharacters::TemplateCharacterTemplateCharacters isundefined if the TV ofTemplateCharacter isundefined or the TV ofTemplateCharacters isundefined. Otherwise, it is thestring-concatenation of the TV ofTemplateCharacter and the TV ofTemplateCharacters.
The TV ofTemplateCharacter::SourceCharacterbut not one of` or\ or$ orLineTerminator is the result of performingUTF16EncodeCodePoint on the code point matched bySourceCharacter.
The TV ofTemplateCharacter::$ is the String value consisting of the code unit 0x0024 (DOLLAR SIGN).
The TV ofTemplateCharacter::\TemplateEscapeSequence is theSV ofTemplateEscapeSequence.
The TV ofTemplateCharacter::\NotEscapeSequence isundefined.
The TV ofTemplateCharacter::LineTerminatorSequence is theTRV ofLineTerminatorSequence.
The TV ofLineContinuation::\LineTerminatorSequence is the empty String.

12.9.6.2 Static Semantics: TRV

Thesyntax-directed operation TRV takes no arguments and returns a String. A template literal component is interpreted by TRV as a value of theString type. TRV is used to construct the raw components of a template object (colloquially, the template raw values). TRV is similar toTV with the difference being that in TRV, escape sequences are interpreted as they appear in the literal.

The TRV ofNoSubstitutionTemplate::`` is the empty String.
The TRV ofTemplateHead::`${ is the empty String.
The TRV ofTemplateMiddle::}${ is the empty String.
The TRV ofTemplateTail::}` is the empty String.
The TRV ofTemplateCharacters::TemplateCharacterTemplateCharacters is thestring-concatenation of the TRV ofTemplateCharacter and the TRV ofTemplateCharacters.
The TRV ofTemplateCharacter::SourceCharacterbut not one of` or\ or$ orLineTerminator is the result of performingUTF16EncodeCodePoint on the code point matched bySourceCharacter.
The TRV ofTemplateCharacter::$ is the String value consisting of the code unit 0x0024 (DOLLAR SIGN).
The TRV ofTemplateCharacter::\TemplateEscapeSequence is thestring-concatenation of the code unit 0x005C (REVERSE SOLIDUS) and the TRV ofTemplateEscapeSequence.
The TRV ofTemplateCharacter::\NotEscapeSequence is thestring-concatenation of the code unit 0x005C (REVERSE SOLIDUS) and the TRV ofNotEscapeSequence.
The TRV ofTemplateEscapeSequence::0 is the String value consisting of the code unit 0x0030 (DIGIT ZERO).
The TRV ofNotEscapeSequence::0DecimalDigit is thestring-concatenation of the code unit 0x0030 (DIGIT ZERO) and the TRV ofDecimalDigit.
The TRV ofNotEscapeSequence::x[lookahead ∉HexDigit] is the String value consisting of the code unit 0x0078 (LATIN SMALL LETTER X).
The TRV ofNotEscapeSequence::xHexDigit[lookahead ∉HexDigit] is thestring-concatenation of the code unit 0x0078 (LATIN SMALL LETTER X) and the TRV ofHexDigit.
The TRV ofNotEscapeSequence::u[lookahead ∉HexDigit][lookahead ≠{] is the String value consisting of the code unit 0x0075 (LATIN SMALL LETTER U).
The TRV ofNotEscapeSequence::uHexDigit[lookahead ∉HexDigit] is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U) and the TRV ofHexDigit.
The TRV ofNotEscapeSequence::uHexDigitHexDigit[lookahead ∉HexDigit] is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U), the TRV of the firstHexDigit, and the TRV of the secondHexDigit.
The TRV ofNotEscapeSequence::uHexDigitHexDigitHexDigit[lookahead ∉HexDigit] is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U), the TRV of the firstHexDigit, the TRV of the secondHexDigit, and the TRV of the thirdHexDigit.
The TRV ofNotEscapeSequence::u{[lookahead ∉HexDigit] is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U) and the code unit 0x007B (LEFT CURLY BRACKET).
The TRV ofNotEscapeSequence::u{NotCodePoint[lookahead ∉HexDigit] is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U), the code unit 0x007B (LEFT CURLY BRACKET), and the TRV ofNotCodePoint.
The TRV ofNotEscapeSequence::u{CodePoint[lookahead ∉HexDigit][lookahead ≠}] is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U), the code unit 0x007B (LEFT CURLY BRACKET), and the TRV ofCodePoint.
The TRV ofDecimalDigit::one of0123456789 is the result of performingUTF16EncodeCodePoint on the single code point matched by this production.
The TRV ofCharacterEscapeSequence::NonEscapeCharacter is theSV ofNonEscapeCharacter.
The TRV ofSingleEscapeCharacter::one of'"\bfnrtv is the result of performingUTF16EncodeCodePoint on the single code point matched by this production.
The TRV ofHexEscapeSequence::xHexDigitHexDigit is thestring-concatenation of the code unit 0x0078 (LATIN SMALL LETTER X), the TRV of the firstHexDigit, and the TRV of the secondHexDigit.
The TRV ofUnicodeEscapeSequence::uHex4Digits is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U) and the TRV ofHex4Digits.
The TRV ofUnicodeEscapeSequence::u{CodePoint} is thestring-concatenation of the code unit 0x0075 (LATIN SMALL LETTER U), the code unit 0x007B (LEFT CURLY BRACKET), the TRV ofCodePoint, and the code unit 0x007D (RIGHT CURLY BRACKET).
The TRV ofHex4Digits::HexDigitHexDigitHexDigitHexDigit is thestring-concatenation of the TRV of the firstHexDigit, the TRV of the secondHexDigit, the TRV of the thirdHexDigit, and the TRV of the fourthHexDigit.
The TRV ofHexDigits::HexDigitsHexDigit is thestring-concatenation of the TRV ofHexDigits and the TRV ofHexDigit.
The TRV ofHexDigit::one of0123456789abcdefABCDEF is the result of performingUTF16EncodeCodePoint on the single code point matched by this production.
The TRV ofLineContinuation::\LineTerminatorSequence is thestring-concatenation of the code unit 0x005C (REVERSE SOLIDUS) and the TRV ofLineTerminatorSequence.
The TRV ofLineTerminatorSequence::<LF> is the String value consisting of the code unit 0x000A (LINE FEED).
The TRV ofLineTerminatorSequence::<CR> is the String value consisting of the code unit 0x000A (LINE FEED).
The TRV ofLineTerminatorSequence::<LS> is the String value consisting of the code unit 0x2028 (LINE SEPARATOR).
The TRV ofLineTerminatorSequence::<PS> is the String value consisting of the code unit 0x2029 (PARAGRAPH SEPARATOR).
The TRV ofLineTerminatorSequence::<CR><LF> is the String value consisting of the code unit 0x000A (LINE FEED).

Note

TV excludes the code units ofLineContinuation while TRV includes them. <CR><LF> and <CR>LineTerminatorSequences are normalized to <LF> for bothTV and TRV. An explicitTemplateEscapeSequence is needed to include a <CR> or <CR><LF> sequence.

12.10 Automatic Semicolon Insertion

Most ECMAScript statements and declarations must be terminated with a semicolon. Such semicolons may always appear explicitly in the source text. For convenience, however, such semicolons may be omitted from the source text in certain situations. These situations are described by saying that semicolons are automatically inserted into the source code token stream in those situations.

12.10.1 Rules of Automatic Semicolon Insertion

In the following rules, “token” means the actual recognized lexical token determined using the current lexicalgoal symbol as described in clause12.

There are three basic rules of semicolon insertion:

When, as the source text is parsed from left to right, a token (called theoffending token) is encountered that is not allowed by any production of the grammar, then a semicolon is automatically inserted before the offending token if one or more of the following conditions is true:
- The offending token is separated from the previous token by at least oneLineTerminator.
- The offending token is}.
- The previous token is) and the inserted semicolon would then be parsed as the terminating semicolon of a do-while statement (14.7.2).
When, as the source text is parsed from left to right, the end of the input stream of tokens is encountered and the parser is unable to parse the input token stream as a single instance of the goal nonterminal, then a semicolon is automatically inserted at the end of the input stream.
When, as the source text is parsed from left to right, a token is encountered that is allowed by some production of the grammar, but the production is arestricted production and the token would be the first token for a terminal or nonterminal immediately following the annotation “[noLineTerminator here]” within the restricted production (and therefore such a token is called a restricted token), and the restricted token is separated from the previous token by at least oneLineTerminator, then a semicolon is automatically inserted before the restricted token.

However, there is an additional overriding condition on the preceding rules: a semicolon is never inserted automatically if the semicolon would then be parsed as an empty statement or if that semicolon would become one of the two semicolons in the header of afor statement (see14.7.4).

Note

The following are the only restricted productions in the grammar:

UpdateExpression

[Yield, Await]

LeftHandSideExpression

[?Yield, ?Await]

[noLineTerminator here]

LeftHandSideExpression

[?Yield, ?Await]

[noLineTerminator here]

ContinueStatement

[Yield, Await]

continue

;

continue

[noLineTerminator here]

LabelIdentifier

[?Yield, ?Await]

;

BreakStatement

[Yield, Await]

break

;

break

[noLineTerminator here]

LabelIdentifier

[?Yield, ?Await]

;

ReturnStatement

[Yield, Await]

return

;

return

[noLineTerminator here]

Expression

[+In, ?Yield, ?Await]

;

ThrowStatement

[Yield, Await]

throw

[noLineTerminator here]

Expression

[+In, ?Yield, ?Await]

;

YieldExpression

[In, Await]

yield

[noLineTerminator here]

AssignmentExpression

[?In, +Yield, ?Await]

yield

[noLineTerminator here]

AssignmentExpression

[?In, +Yield, ?Await]

ArrowFunction

[In, Yield, Await]

ArrowParameters

[?Yield, ?Await]

[noLineTerminator here]

ConciseBody

[?In]

AsyncFunctionDeclaration

[Yield, Await, Default]

async

[noLineTerminator here]

function

[?Yield, ?Await]

(

[~Yield, +Await]

)

{

}

[+Default]

async

[noLineTerminator here]

function

(

[~Yield, +Await]

)

{

}

AsyncFunctionExpression

async

[noLineTerminator here]

function

[~Yield, +Await]

opt

(

[~Yield, +Await]

)

{

}

AsyncMethod

[Yield, Await]

async

[noLineTerminator here]

ClassElementName

[?Yield, ?Await]

(

UniqueFormalParameters

[~Yield, +Await]

)

{

AsyncGeneratorDeclaration

}

[Yield, Await, Default]

async

[noLineTerminator here]

function

[?Yield, ?Await]

(

[+Yield, +Await]

)

{

}

[+Default]

async

[noLineTerminator here]

function

(

[+Yield, +Await]

)

{

}

AsyncGeneratorExpression

async

[noLineTerminator here]

function

[+Yield, +Await]

opt

(

[+Yield, +Await]

)

{

}

AsyncGeneratorMethod

[Yield, Await]

async

[noLineTerminator here]

ClassElementName

[?Yield, ?Await]

(

UniqueFormalParameters

[+Yield, +Await]

)

{