Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Commitbb904e0

Browse files
authored
closesgh-124016: update Unicode to 16.0.0 (#124017)
1 parenta9594a3 commitbb904e0

File tree

12 files changed

+22581
-20691
lines changed

12 files changed

+22581
-20691
lines changed

‎Doc/library/stdtypes.rst‎

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1679,7 +1679,7 @@ expression support in the :mod:`re` module).
16791679

16801680
The casefolding algorithm is
16811681
`described in section 3.13 'Default Case Folding' of the Unicode Standard
1682-
<https://www.unicode.org/versions/Unicode15.1.0/ch03.pdf>`__.
1682+
<https://www.unicode.org/versions/Unicode16.0.0/core-spec/chapter-3/#G33992>`__.
16831683

16841684
..versionadded::3.3
16851685

@@ -1843,7 +1843,7 @@ expression support in the :mod:`re` module).
18431843
property being one of "Lm", "Lt", "Lu", "Ll", or "Lo". Note that this is different
18441844
from the `Alphabetic property defined in the section 4.10 'Letters, Alphabetic, and
18451845
Ideographic' of the Unicode Standard
1846-
<https://www.unicode.org/versions/Unicode15.1.0/ch04.pdf>`_.
1846+
<https://www.unicode.org/versions/Unicode16.0.0/core-spec/chapter-4/#G91002>`_.
18471847

18481848

18491849
..method::str.isascii()
@@ -1979,7 +1979,7 @@ expression support in the :mod:`re` module).
19791979

19801980
The lowercasing algorithm used is
19811981
`described in section 3.13 'Default Case Folding' of the Unicode Standard
1982-
<https://www.unicode.org/versions/Unicode15.1.0/ch03.pdf>`__.
1982+
<https://www.unicode.org/versions/Unicode16.0.0/core-spec/chapter-3/#G33992>`__.
19831983

19841984

19851985
..method::str.lstrip([chars])
@@ -2331,7 +2331,7 @@ expression support in the :mod:`re` module).
23312331

23322332
The uppercasing algorithm used is
23332333
`described in section 3.13 'Default Case Folding' of the Unicode Standard
2334-
<https://www.unicode.org/versions/Unicode15.1.0/ch03.pdf>`__.
2334+
<https://www.unicode.org/versions/Unicode16.0.0/core-spec/chapter-3/#G33992>`__.
23352335

23362336

23372337
..method::str.zfill(width)

‎Doc/library/unicodedata.rst‎

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,8 @@
1717

1818
This module provides access to the Unicode Character Database (UCD) which
1919
defines character properties for all Unicode characters. The data contained in
20-
this database is compiled from the `UCD version15.1.0
21-
<https://www.unicode.org/Public/15.1.0/ucd>`_.
20+
this database is compiled from the `UCD version16.0.0
21+
<https://www.unicode.org/Public/16.0.0/ucd>`_.
2222

2323
The module uses the same names and symbols as defined by Unicode
2424
Standard Annex #44, `"Unicode Character Database"
@@ -175,6 +175,6 @@ Examples:
175175

176176
..rubric::Footnotes
177177

178-
.. [#]https://www.unicode.org/Public/15.1.0/ucd/NameAliases.txt
178+
.. [#]https://www.unicode.org/Public/16.0.0/ucd/NameAliases.txt
179179
180-
.. [#]https://www.unicode.org/Public/15.1.0/ucd/NamedSequences.txt
180+
.. [#]https://www.unicode.org/Public/16.0.0/ucd/NamedSequences.txt

‎Doc/reference/lexical_analysis.rst‎

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -314,16 +314,16 @@ The Unicode category codes mentioned above stand for:
314314
* *Nd* - decimal numbers
315315
* *Pc* - connector punctuations
316316
* *Other_ID_Start* - explicit list of characters in `PropList.txt
317-
<https://www.unicode.org/Public/15.1.0/ucd/PropList.txt>`_ to support backwards
317+
<https://www.unicode.org/Public/16.0.0/ucd/PropList.txt>`_ to support backwards
318318
compatibility
319319
* *Other_ID_Continue* - likewise
320320

321321
All identifiers are converted into the normal form NFKC while parsing; comparison
322322
of identifiers is based on NFKC.
323323

324324
A non-normative HTML file listing all valid identifier characters for Unicode
325-
15.1.0 can be found at
326-
https://www.unicode.org/Public/15.1.0/ucd/DerivedCoreProperties.txt
325+
16.0.0 can be found at
326+
https://www.unicode.org/Public/16.0.0/ucd/DerivedCoreProperties.txt
327327

328328

329329
.. _keywords:
@@ -1044,4 +1044,4 @@ occurrence outside string literals and comments is an unconditional error:
10441044
10451045
..rubric::Footnotes
10461046

1047-
.. [#]https://www.unicode.org/Public/15.1.0/ucd/NameAliases.txt
1047+
.. [#]https://www.unicode.org/Public/16.0.0/ucd/NameAliases.txt

‎Doc/whatsnew/3.14.rst‎

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -253,6 +253,11 @@ symtable
253253

254254
(Contributed by Bénédikt Tran in:gh:`120029`.)
255255

256+
unicodedata
257+
-----------
258+
259+
* The Unicode database has been updated to Unicode 16.0.0.
260+
256261
.. Add improved modules above alphabetically, not here at the end.
257262
258263
Optimizations

‎Lib/test/string_tests.py‎

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1132,8 +1132,8 @@ def test_capitalize_nonascii(self):
11321132
self.checkequal('\u2160\u2171\u2172',
11331133
'\u2170\u2171\u2172','capitalize')
11341134
# check with Ll chars with no upper - nothing changes here
1135-
self.checkequal('\u019b\u1d00\u1d86\u0221\u1fb7',
1136-
'\u019b\u1d00\u1d86\u0221\u1fb7','capitalize')
1135+
self.checkequal('\u1d00\u1d86\u0221\u1fb7',
1136+
'\u1d00\u1d86\u0221\u1fb7','capitalize')
11371137

11381138
deftest_startswith(self):
11391139
self.checkequal(True,'hello','startswith','he')

‎Lib/test/test_str.py‎

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2430,8 +2430,10 @@ def __repr__(self):
24302430
self.assertEqual(repr(s1()),'\\n')
24312431

24322432
deftest_printable_repr(self):
2433-
self.assertEqual(repr('\U00010000'),"'%c'"% (0x10000,))# printable
2434-
self.assertEqual(repr('\U00014000'),"'\\U00014000'")# nonprintable
2433+
# printable
2434+
self.assertEqual(repr('\U00010000'),"'%c'"% (0x10000,))
2435+
# nonprintable (private use area)
2436+
self.assertEqual(repr('\U00100001'),"'\\U00100001'")
24352437

24362438
# This test only affects 32-bit platforms because expandtabs can only take
24372439
# an int as the max value, not a 64-bit C long. If expandtabs is changed

‎Lib/test/test_unicodedata.py‎

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@
1818
classUnicodeMethodsTest(unittest.TestCase):
1919

2020
# update this, if the database changes
21-
expectedchecksum='63aa77dcb36b0e1df082ee2a6071caeda7f0955e'
21+
expectedchecksum='9e43ee3929471739680c0e705482b4ae1c4122e4'
2222

2323
@requires_resource('cpu')
2424
deftest_method_checksum(self):
@@ -71,7 +71,7 @@ class UnicodeFunctionsTest(UnicodeDatabaseTest):
7171

7272
# Update this if the database changes. Make sure to do a full rebuild
7373
# (e.g. 'make distclean && make') to get the correct checksum.
74-
expectedchecksum='232affd2a50ec4bd69d2482aa0291385cbdefaba'
74+
expectedchecksum='23ab09ed4abdf93db23b97359108ed630dd8311d'
7575

7676
@requires_resource('cpu')
7777
deftest_function_checksum(self):
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
Update:mod:`unicodedata` database to Unicode 16.0.0.

0 commit comments

Comments
 (0)

[8]ページ先頭

©2009-2025 Movatter.jp