Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Commit7cd082f

Browse files
committed
Clarify that surrogate pairs are not encoded in UTF-8 directly
1 parentc5d94a3 commit7cd082f

File tree

1 file changed

+28
-21
lines changed

1 file changed

+28
-21
lines changed

‎doc/src/sgml/syntax.sgml

Lines changed: 28 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
<!-- $PostgreSQL: pgsql/doc/src/sgml/syntax.sgml,v 1.154 2010/09/01 18:22:29 tgl Exp $ -->
1+
<!-- $PostgreSQL: pgsql/doc/src/sgml/syntax.sgml,v 1.155 2010/09/07 18:54:09 petere Exp $ -->
22

33
<chapter id="sql-syntax">
44
<title>SQL Syntax</title>
@@ -236,12 +236,15 @@ U&amp;"d!0061t!+000061" UESCAPE '!'
236236

237237
<para>
238238
The Unicode escape syntax works only when the server encoding is
239-
UTF8. When other server encodings are used, only code points in
240-
the ASCII range (up to <literal>\007F</literal>) can be specified.
241-
Both the 4-digit and the 6-digit form can be used to specify
242-
UTF-16 surrogate pairs to compose characters with code points
243-
larger than U+FFFF (although the availability of
244-
the 6-digit form technically makes this unnecessary).
239+
<literal>UTF8</>. When other server encodings are used, only code
240+
points in the ASCII range (up to <literal>\007F</literal>) can be
241+
specified. Both the 4-digit and the 6-digit form can be used to
242+
specify UTF-16 surrogate pairs to compose characters with code
243+
points larger than U+FFFF, although the availability of the
244+
6-digit form technically makes this unnecessary. (When surrogate
245+
pairs are used when the server encoding is <literal>UTF8</>, they
246+
are first combined into a single code point that is then encoded
247+
in UTF-8.)
245248
</para>
246249

247250
<para>
@@ -431,13 +434,15 @@ SELECT 'foo' 'bar';
431434

432435
<para>
433436
The Unicode escape syntax works fully only when the server
434-
encoding is UTF-8. When other server encodings are used, only
435-
code points in the ASCII range (up to <literal>\u007F</>) can be
436-
specified. Both the 4-digit and the 8-digit form can be used to
437-
specify UTF-16 surrogate pairs to compose characters with code
438-
points larger than U+FFFF (although the
439-
availability of the 8-digit form technically makes this
440-
unnecessary).
437+
encoding is <literal>UTF8</>. When other server encodings are
438+
used, only code points in the ASCII range (up
439+
to <literal>\u007F</>) can be specified. Both the 4-digit and
440+
the 8-digit form can be used to specify UTF-16 surrogate pairs to
441+
compose characters with code points larger than U+FFFF, although
442+
the availability of the 8-digit form technically makes this
443+
unnecessary. (When surrogate pairs are used when the server
444+
encoding is <literal>UTF8</>, they are first combined into a
445+
single code point that is then encoded in UTF-8.)
441446
</para>
442447

443448
<caution>
@@ -517,13 +522,15 @@ U&amp;'d!0061t!+000061' UESCAPE '!'
517522

518523
<para>
519524
The Unicode escape syntax works only when the server encoding is
520-
UTF8. When other server encodings are used, only code points in
521-
the ASCII range (up to <literal>\007F</literal>) can be
522-
specified.
523-
Both the 4-digit and the 6-digit form can be used to specify
524-
UTF-16 surrogate pairs to compose characters with code points
525-
larger than U+FFFF (although the availability
526-
of the 6-digit form technically makes this unnecessary).
525+
<literal>UTF8</>. When other server encodings are used, only
526+
code points in the ASCII range (up to <literal>\007F</literal>)
527+
can be specified. Both the 4-digit and the 6-digit form can be
528+
used to specify UTF-16 surrogate pairs to compose characters with
529+
code points larger than U+FFFF, although the availability of the
530+
6-digit form technically makes this unnecessary. (When surrogate
531+
pairs are used when the server encoding is <literal>UTF8</>, they
532+
are first combined into a single code point that is then encoded
533+
in UTF-8.)
527534
</para>
528535

529536
<para>

0 commit comments

Comments
 (0)

[8]ページ先頭

©2009-2025 Movatter.jp