Movatterモバイル変換


[0]ホーム

URL:


RFC 9233IDNA2008 and Unicode 12March 2022
FältströmStandards Track[Page]
Stream:
Internet Engineering Task Force (IETF)
RFC:
9233
Category:
Standards Track
Published:
ISSN:
2070-1721
Author:
P. Fältström
Netnod

RFC 9233

Internationalized Domain Names for Applications 2008 (IDNA2008) and Unicode 12.0.0

Abstract

This document describes the changes between Unicode 6.0.0 and Unicode 12.0.0 in the context of the current version of Internationalized Domain Names for Applications 2008 (IDNA2008). Some additions and changes have been made in the Unicode Standard that affect the values produced by the algorithm IDNA2008 specifies. IDNA2008 allows adding exceptions to the algorithm for backward compatibility; however, this document does not add any such exceptions. This document provides the necessary tables to IANA to make its database consistent with Unicode 12.0.0.

To improve understanding, this document describes systems that are being used as alternatives to those that conform to IDNA2008.

Status of This Memo

This is an Internet Standards Track document.

This document is a product of the Internet Engineering Task Force (IETF). It represents the consensus of the IETF community. It has received public review and has been approved for publication by the Internet Engineering Steering Group (IESG). Further information on Internet Standards is available in Section 2 of RFC 7841.

Information about the current status of this document, any errata, and how to provide feedback on it may be obtained athttps://www.rfc-editor.org/info/rfc9233.

Copyright Notice

Copyright (c) 2022 IETF Trust and the persons identified as the document authors. All rights reserved.

This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.

Table of Contents

1.Introduction

The current version of Internationalized Domain Names for Applications (IDNA) was initiated in 2008, and despite not being completed until 2010, is widely known as "IDNA2008". It is specified in the series of documents listed inSection 2.1. The IDNA2008 standard includes an algorithm by which a derived property value is calculated based on the properties defined in the Unicode Standard.

The derived property values that can be calculated are defined inRFC 5892 [RFC5892]. Below is a summary to aid in the reading of this document. For definition of the terms, please seeRFC 5892 [RFC5892].

PROTOCOL VALID:
Those that are allowed to be used in IDNs. Code points with this property value are permitted for general use in IDNs. However, the fact that a label consists only of code points with this property value does not imply that the label can be used in DNS. The abbreviated term PVALID is used to refer to this value.
CONTEXTUAL RULE REQUIRED:
Some characteristics of the character, such as it being invisible in certain contexts or problematic in others, require that it not be used in labels unless specific other characters or properties are present. The abbreviated term CONTEXT is used to refer to this value. As explained inRFC 5892 [RFC5892], CONTEXT is in turn divided into CONTEXTJ and CONTEXTO.
DISALLOWED:
Those that should clearly not be included in IDNs. Code points with this property value are not permitted in IDNs.
UNASSIGNED:
Those code points that are not designated (i.e., are unassigned) in the Unicode Standard.

When the Unicode Standard is updated, new code points are assigned and already assigned code points can have their property values changed.

There were three incompatible changes in the Unicode Standard betweenUnicode 5.2.0 [Unicode-5.2.0] andUnicode 6.0.0 [Unicode-6.0.0]; they are described inRFC 6452 [RFC6452]. The code points U+0CF1 and U+0CF2 had a derived property value change from DISALLOWED to PVALID, and the code point U+19DA had a change in derived property value from PVALID to DISALLOWED. These changes where examined in great detail, but the IETF concluded that these changes to the Unicode Standard did not warrant an update toRFC 5892 [RFC5892].

As described inSection 3, more incompatible changes have been made to code points between Unicode 6.0.0 andUnicode 12.0.0 [Unicode-12.0.0]; however, the changes in the derived property values do not result in exceptions (as defined in Section2.6 ofRFC 5892 [RFC5892]) that would require an update to the "IDNA Contextual Rules" registry (which would also be considered an update toRFC 5892 [RFC5892]).

Further, in 2015, the Internet Architecture Board (IAB) issueda statement [IAB2005-1] that advised the community to avoid using any of the potentially problematic code points and asked the IETF to resolve the issues related to the code point ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) that was introduced inUnicode 7.0.0 [Unicode-7.0.0]. In February of that year, the statement wasrevised [IAB2005-2] to focus on the latter request. More details about the problem of code point sequences not normalizing as one might expect appear ina draft that was part of the discussion [IDNA7].

The result of the work in the IETF was that no exception was added toRFC 5892 [RFC5892]; however, it should be noted that the review of the issues around U+08A1 indicated that this code point is not an isolated case and that a number of long-standing PVALID code points may have similar issues. While the affected code points remain PVALID in this document, identification of the problem resulted in a clarification of the review process for new Unicode versions. That clarification, which reinforces the original review plan to capture issues like these, was published asRFC 8753 [RFC8753]. Any review of Unicode versions after 12.0.0 should be made according toRFC 8753 [RFC8753]; an objective of this document is to ensure that a proper review of such versions after version 12.0.0 can be made.

2.Background

2.1.IDNA2008 Documents

IDNA2008 consists of the following documents. The documents in the set have informal names.

2.2.Additional Important IDNA2008-Related Documents

There are other documents important for the understanding and functioning of IDNA2008, for example this.

2.3.Deployment

There are many variations on the general IDNA model in use in the various parts of the community. The following lists some of the strategies that implementations that claim to be IDNA compliant are known to use, but it should be noted the list is not complete:

  • IDNA2003 as specified inRFC 3490 [RFC3490] andRFC 3491 [RFC3491]. Those specifications are dependent on case folding, Normalization Form KC (NFKC), and on tables that specify for each code point whether it is allowed to be used or not, with a distinction made between use for "stored strings" and "query strings". The tables themselves are dependent onUnicode 3.2 [Unicode-3.2.0].
  • A number of variations on IDNA2003, sometimes presented as "updated IDNA2003" or the like, which follow the principles of IDNA2003 as understood by the implementers but that use tables that represent how the implementers believeStringprep [RFC3454] andNameprep [RFC3491] would have evolved had the IETF not moved in the direction of IDNA2008 instead.
  • A mix between IDNA2003 and IDNA2008 where code points assigned to Unicode afterUnicode 3.2.0 [Unicode-3.2.0] have derived property value calculated according to the algorithm specified in IDNA2008.
  • A mix between IDNA2003 and IDNA2008 according to theUnicode Technical Standard #46 [UTS-46]. Because that document specifies different profiles, there are several variations that leave users with no guarantee that two applications claiming conformance to UTS#46 will interoperate well with each other much less with conforming IDNA2008 implementations. UTS#46 is ultimately based on a normative table very much like the one used byStringprep [RFC3454] but updated for each new version of Unicode.
  • The (normative) IDNA2008 algorithm applied to whatever version of Unicode Standard exists in the operating system and/or libraries used, independent of whatever version of tables appears in the (non-normative) IANA database.

In practice, the Unicode Consortium creates a maximum set of code points by assigning code points in the Unicode Standard. The IDNA2008 rules use the Unicode Standard to create a further subset of code points and context that are permitted in DNS labels associated with its PVALID and CONTEXT (CONTEXTJ or CONTEXTO) derived property values. DNS registries and other organizations that deal with IDNs are supposed to create their own subsets from IDNA2008 for use by those registries and organizations.

This progressive subsetting and narrowing of the repertoire of code points that can be used in labels is an implementation of the principles of being conservative when deciding what code points to include in such a subset.SAC-084 [SAC-084] andRFC 6912 [RFC6912] recommend to DNS registries and other organizations to be conservative when creating their subsets and to use the principle of creating subsets by inclusion.

See alsoSecurity Considerations (Section 7) in this document.

3.Notable Changes between Unicode 6.0.0 and 12.0.0

Among the changes between the Unicode versions, most code points that change derived property value change from UNASSIGNED to PVALID or from UNASSIGNED to DISALLOWED. The interesting changes in derived property values include other changes. All changes between the major versions of Unicode can be found inAppendix A (6.0.0-7.0.0),Appendix B (7.0.0-8.0.0),Appendix C (8.0.0-9.0.0),Appendix D (9.0.0-10.0.0),Appendix E (10.0.0-11.0.0), andAppendix F (11.0.0-12.0.0).

3.1.Changes between Unicode 6.0.0 and 7.0.0

Change in number of characters in each category:

  • PVALID changed from 97418 to 99867 (+2449)
  • UNASSIGNED changed from 865081 to 861509 (-3572)
  • CONTEXTJ did not change, at 2
  • CONTEXTO did not change, at 25
  • DISALLOWED changed from 151586 to 152709 (+1123)
  • TOTAL did not change, at 1114112

There are no changes made to Unicode between version 6.0.0 and 7.0.0 that impact IDNA2008 calculation of the derived property values.

The code points U+17B4 KHMER VOWEL INHERENT AQ and U+17B5 KHMER VOWEL INHERENT AA both changed the General Category from Cf (Format) to Mn (Nonspacing_Mark), but that did not impact the calculation of the derived property value which stayed at DISALLOWED.

The character ARABIC LETTER BEH WITH HAMZA ABOVE (U+08A1) was introduced in Unicode 7.0.0. This was discussed extensively in the IETF and also by the IAB intheir statement [IAB2005-1] requesting the IETF to investigate the issue. Specifically, the IAB stated:

On the same precautionary principle, the IAB recommends that the Internationalized Domain Names for Applications (IDNA) Parameters registry<https://www.iana.org/assignments/idna-tables/> not be updated to Unicode 7.0.0 until the IETF has consensus on a solution to this problem.

The discussion in the IETF concluded that although it is possible to create "the same" character in multiple ways, the issue with U+08A1 is not unique. The character U+08A1 (ARABIC LETTER BEH WITH HAMZA ABOVE) can be represented with the sequence ARABIC LETTER BEH (U+0628) and ARABIC HAMZA ABOVE (U+0654). This is identical to LATIN SMALL LETTER O WITH STROKE (U+00F8), which can be represented with the sequence LATIN SMALL LETTER O (U+006F) followed by COMBINING SHORT SOLIDUS OVERLAY (U+0337).

Although the discussion about this specific code point resulted in acceptance of the derived property value of PVALID, the underlying problem with combining sequences is not understood fully. Therefore, it cannot be claimed that this case can be extrapolated to other situations and other code points.

3.2.Changes between Unicode 7.0.0 and 10.0.0

Change in number of characters in each category:

  • Code points that changed derived property value: 0
  • PVALID changed from 99867 to 122411 (+22544)
  • UNASSIGNED changed from 861509 to 837775 (-23734)
  • CONTEXTJ did not change, at 2
  • CONTEXTO did not change, at 25
  • DISALLOWED changed from 152709 to 153899 (+1190)
  • TOTAL did not change, at 1114112

There are no changes made to Unicode between version 7.0.0 and 10.0.0 that impact IDNA2008 calculation of the derived property values.

3.3.Changes between Unicode 10.0.0 and 11.0.0

Change in number of characters in each category:

  • Code points that changed derived property value: 1
  • PVALID changed from 122411 to 122734 (+323)
  • UNASSIGNED changed from 837775 to 837091 (-684)
  • CONTEXTJ did not change, at 2
  • CONTEXTO did not change, at 25
  • DISALLOWED changed from 153899 to 154260 (+361)
  • TOTAL did not change, at 1114112
  • Georgian letters in the ranges U+10D0..U+10FA and U+10FD..U+10FF had their General Category changed from Lo (Other_Letter) to Ll (Lowercase_Letter) to reflect their status as the lowercase of new Georgian case pairs. Case mappings were also added.
  • SHARADA SANDHI MARK (U+111C9) General Category was changed from Po (Other_Punctuation) to Mn (Nonspacing_Mark), and the Bidi property was changed from L (Left to Right) to NSM (Nonspacing Mark).
  • The properties for ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and ZANABAZAR SQUARE VOWEL SIGN AU (U+11A08) were corrected from Mc to Mn.
  • SPHERICAL ANGLE OPENING UP (U+29A1) was changed to Bidi Mirrored to No.

These changes to the Unicode Standard have the following implications for these code points:

  • The newly assigned 684 characters are assigned a derived property value as of a result of applying the IDNA2008 algorithm.
  • The Georgian letters in the ranges U+10D0..U+10FA and U+10FD..U+10FF existed before IDNA2008 was created. Applying the IDNA2008 algorithm to the code points assigned the derived property value PVALID, and that value is unchanged even if the underlying Unicode properties have changed. The newly encoded Mtavruli letters have General Category Lu (Uppercase_Letter) and are therefore DISALLOWED.
  • The U+111C9 SHARADA SANDHI MARK was added toUnicode 8.0.0 [Unicode-8.0.0]. Applying the IDNA2008 algorithm to the code point assigned the derived property value DISALLOWED. The changes in the underlying properties inUnicode 11.0.0 [Unicode-11.0.0] caused the derived property value to change to PVALID.
  • The characters ZANABAZAR SQUARE VOWEL SIGN AI (U+11A07) and ZANABAZAR SQUARE VOWEL SIGN AU (U+11A08) were added toUnicode 10.0.0 [Unicode-10.0.0]. Applying the IDNA2008 algorithm to the code points assigned the derived property value PVALID, and that value is unchanged even if the underlying Unicode properties have changed.
  • SPHERICAL ANGLE OPENING UP (U+29A1) existed before IDNA2008 was created. Applying the IDNA2008 algorithm to the code point assigned the derived property value DISALLOWED, and that value is unchanged even if the underlying Unicode properties have changed.

3.4.Changes between Unicode 11.0.0 and 12.0.0

Change in number of characters in each category:

  • Code points that changed derived property value: 0
  • PVALID changed from 122734 to 123006 (+272)
  • UNASSIGNED changed from 837091 to 836537 (-554)
  • CONTEXTJ did not change, at 2
  • CONTEXTO did not change, at 25
  • DISALLOWED changed from 154260 to 154542 (+282)
  • TOTAL did not change, at 1114112

4.U+111C9 SHARADA SANDHI MARK

As one can see inSection 3, an incompatible property change was made between Unicode 6.0.0 and 12.0.0, affecting the code point U+111C9. Its derived property value thus changed from DISALLOWED to PVALID. In situations like these, IDNA2008 allows for addition of rules toRFC 5892 [RFC5892], Section2.7. If the code point is accepted, it might still be rejected if validated by software based on versions of Unicode older than 12.0.0. As the character is rarely used outside the group of Sharada specialists but is used in some records for indicating sandhi breaks, the conclusion was that it could either be added as an exception or allowed to change its property value. As including an exception would require implementation changes to deployments of IDNA20008, the IETF has decided not to add a BackwardCompatible rule to IDNA2008 (i.e., Section2.7 ofRFC 5892 [RFC5892]) for this code point. This also ensures all sandhi marks are treated equally.

5.Conclusion

As described in Sections3 and4, changes have been made to Unicode between version 6.0.0 and 12.0.0. Some changes to specific characters changed their derived property value, whereas other changes did not. Given the deployment considerations described inSection 2.3 and changes in the Unicode Standard described in Sections3 and4, including implications to normalization, the conclusion is not to add any exception rules to IDNA2008.

This document addresses only changes to Unicode between version 6.0.0 and version 12.0.0. Changes in future Unicode versions might result in the conclusion that exception rules need to be added to IDNA2008 after the review process explained inRFC 8753 [RFC8753]. Separately from any changes in Unicode, the IETF might conclude that updates toRFC 5892 [RFC5892] or other IDNA2008 documents might become necessary; such updates might include changes to the algorithm specified in IDNA2008 as well as additional rules, categories, or other forms of tuning, like the clarifications inRFC 8753 [RFC8753].

6.IANA Considerations

IANA updated the"IDNA Rules and Derived Property Values" [IANA-IDNA] registry after the expert reviewer validated that the derived property values were calculated correctly.

7.Security Considerations

This document makes recommendations regarding the use of the IDNA2008 algorithm for calculation of derived property values, based on Unicode version 12.0.0. This recommendation does not say anything about what recommendations to make for future versions of the Unicode Standard.

Not following these recommendations can lead to various security issues. Specifically, allowing confusable characters may lead to various phishing attacks, as described in the Security Consideration Sections in the documents listed inSection 2.1.

8.References

8.1.Normative References

[RFC3491]
Hoffman, P. andM. Blanchet,"Nameprep: A Stringprep Profile for Internationalized Domain Names (IDN)",RFC 3491,DOI 10.17487/RFC3491,,<https://www.rfc-editor.org/info/rfc3491>.
[RFC5890]
Klensin, J.,"Internationalized Domain Names for Applications (IDNA): Definitions and Document Framework",RFC 5890,DOI 10.17487/RFC5890,,<https://www.rfc-editor.org/info/rfc5890>.
[RFC5891]
Klensin, J.,"Internationalized Domain Names in Applications (IDNA): Protocol",RFC 5891,DOI 10.17487/RFC5891,,<https://www.rfc-editor.org/info/rfc5891>.
[RFC5892]
Faltstrom, P., Ed.,"The Unicode Code Points and Internationalized Domain Names for Applications (IDNA)",RFC 5892,DOI 10.17487/RFC5892,,<https://www.rfc-editor.org/info/rfc5892>.
[RFC5893]
Alvestrand, H., Ed. andC. Karp,"Right-to-Left Scripts for Internationalized Domain Names for Applications (IDNA)",RFC 5893,DOI 10.17487/RFC5893,,<https://www.rfc-editor.org/info/rfc5893>.
[RFC6452]
Faltstrom, P., Ed. andP. Hoffman, Ed.,"The Unicode Code Points and Internationalized Domain Names for Applications (IDNA) - Unicode 6.0",RFC 6452,DOI 10.17487/RFC6452,,<https://www.rfc-editor.org/info/rfc6452>.

8.2.Informative References

[IAB2005-1]
Internet Architecture Board,"IAB Statement on Identifiers and Unicode 7.0.0",,<https://www.iab.org/documents/correspondence-reports-documents/2015-2/iab-statement-on-identifiers-and-unicode-7-0-0/archive/>.
[IAB2005-2]
Internet Architecture Board,"IAB Statement on Identifiers and Unicode 7.0.0",,<https://www.iab.org/documents/correspondence-reports-documents/2015-2/iab-statement-on-identifiers-and-unicode-7-0-0/>.
[IANA-IDNA]
IANA,"IDNA Rules and Derived Property Values",,<https://www.iana.org/assignments/idna-tables-12.0.0/>.
[IDNA7]
Klensin, J. C. andP. Faltstrom,"IDNA Update for Unicode 7.0 and Later Versions",Work in Progress,Internet-Draft, draft-klensin-idna-5892upd-unicode70-05,,<https://datatracker.ietf.org/doc/html/draft-klensin-idna-5892upd-unicode70-05>.
[RFC3454]
Hoffman, P. andM. Blanchet,"Preparation of Internationalized Strings ("stringprep")",RFC 3454,DOI 10.17487/RFC3454,,<https://www.rfc-editor.org/info/rfc3454>.
[RFC3490]
Faltstrom, P.,Hoffman, P., andA. Costello,"Internationalizing Domain Names in Applications (IDNA)",RFC 3490,DOI 10.17487/RFC3490,,<https://www.rfc-editor.org/info/rfc3490>.
[RFC5894]
Klensin, J.,"Internationalized Domain Names for Applications (IDNA): Background, Explanation, and Rationale",RFC 5894,DOI 10.17487/RFC5894,,<https://www.rfc-editor.org/info/rfc5894>.
[RFC5895]
Resnick, P. andP. Hoffman,"Mapping Characters for Internationalized Domain Names in Applications (IDNA) 2008",RFC 5895,DOI 10.17487/RFC5895,,<https://www.rfc-editor.org/info/rfc5895>.
[RFC6912]
Sullivan, A.,Thaler, D.,Klensin, J., andO. Kolkman,"Principles for Unicode Code Point Inclusion in Labels in the DNS",RFC 6912,DOI 10.17487/RFC6912,,<https://www.rfc-editor.org/info/rfc6912>.
[RFC8753]
Klensin, J. andP. Fältström,"Internationalized Domain Names for Applications (IDNA) Review for New Unicode Versions",RFC 8753,DOI 10.17487/RFC8753,,<https://www.rfc-editor.org/info/rfc8753>.
[SAC-084]
The Security and Stability Advisory Committee,"SAC084",SSAC Comments on Guidelines for the Extended Process Similarity Review Panel for the IDN ccTLD Fast Track Process,,<https://www.icann.org/en/system/files/files/sac-084-en.pdf>.
[Unicode-3.2.0]
The Unicode Consortium,"The Unicode Standard, Version 3.2.0",Mountain View: The Unicode Consortium,ISBN 0-201-61633-5,,<https://www.unicode.org/versions/Unicode3.2.0/>.
[Unicode-5.2.0]
The Unicode Consortium,"The Unicode Standard, Version 5.2.0",Mountain View: The Unicode Consortium,ISBN 978-1-936213-00-9,,<https://www.unicode.org/versions/Unicode5.2.0/>.
[Unicode-6.0.0]
The Unicode Consortium,"The Unicode Standard, Version 6.0.0",Mountain View: The Unicode Consortium,ISBN 978-1-936213-01-6,,<https://www.unicode.org/versions/Unicode6.0.0/>.
[Unicode-7.0.0]
The Unicode Consortium,"The Unicode Standard, Version 7.0.0",Mountain View: The Unicode Consortium,ISBN 978-1-936213-09-2,,<https://www.unicode.org/versions/Unicode7.0.0/>.
[Unicode-8.0.0]
The Unicode Consortium,"The Unicode Standard, Version 8.0.0",Mountain View: The Unicode Consortium,ISBN 978-1-936213-10-8,,<https://www.unicode.org/versions/Unicode8.0.0/>.
[Unicode-10.0.0]
The Unicode Consortium,"The Unicode Standard, Version 10.0.0",Mountain View: The Unicode Consortium,ISBN 978-1-936213-16-0,,<https://www.unicode.org/versions/Unicode10.0.0/>.
[Unicode-11.0.0]
The Unicode Consortium,"The Unicode Standard, Version 11.0.0",Mountain View: The Unicode Consortium,ISBN 978-1-936213-19-1,,<https://www.unicode.org/versions/Unicode11.0.0/>.
[Unicode-12.0.0]
The Unicode Consortium,"The Unicode Standard, Version 12.0.0",Mountain View: The Unicode Consortium,ISBN 978-1-936213-22-1,,<https://www.unicode.org/versions/Unicode12.0.0/>.
[UTS-46]
The Unicode Consortium,"Unicode Technical Standard #46, Version 12.0.0",UNICODE IDNA COMPATIBILITY PROCESSING,,<https://www.unicode.org/reports/tr46/tr46-23.html>.

Appendix A.Changes from Unicode 6.0.0 to Unicode 7.0.0

Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED.

037F        ; DISALLOWED # GREEK CAPITAL LETTER YOT0528        ; DISALLOWED # CYRILLIC CAPITAL LETTER EN WITH LEFT HOOK0529        ; PVALID     # CYRILLIC SMALL LETTER EN WITH LEFT HOOK052A        ; DISALLOWED # CYRILLIC CAPITAL LETTER DZZHE052B        ; PVALID     # CYRILLIC SMALL LETTER DZZHE052C        ; DISALLOWED # CYRILLIC CAPITAL LETTER DCHE052D        ; PVALID     # CYRILLIC SMALL LETTER DCHE052E        ; DISALLOWED # CYRILLIC CAPITAL LETTER EL WITH DESCENDER052F        ; PVALID     # CYRILLIC SMALL LETTER EL WITH DESCENDER058D..058F  ; DISALLOWED # RIGHT-FACING ARMENIAN ETERNITY SIGN..ARMENIAN0604..0605  ; DISALLOWED # ARABIC SIGN SAMVAT..ARABIC NUMBER MARK ABOVE061C        ; DISALLOWED # ARABIC LETTER MARK08A0..08B2  ; PVALID     # ARABIC LETTER BEH WITH SMALL V BELOW..ARABIC08E4..08FF  ; PVALID     # ARABIC CURLY FATHA..ARABIC MARK SIDEWAYS NOON0978        ; PVALID     # DEVANAGARI LETTER MARWARI DDA0980        ; PVALID     # BENGALI ANJI0AF0        ; DISALLOWED # GUJARATI ABBREVIATION SIGN0C00        ; PVALID     # TELUGU SIGN COMBINING CANDRABINDU ABOVE0C34        ; PVALID     # TELUGU LETTER LLLA0C81        ; PVALID     # KANNADA SIGN CANDRABINDU0D01        ; PVALID     # MALAYALAM SIGN CANDRABINDU0DE6..0DEF  ; PVALID     # SINHALA LITH DIGIT ZERO..SINHALA LITH DIGIT N0EDE..0EDF  ; PVALID     # LAO LETTER KHMU GO..LAO LETTER KHMU NYO10C7        ; DISALLOWED # GEORGIAN CAPITAL LETTER YN10CD        ; DISALLOWED # GEORGIAN CAPITAL LETTER AEN10FD..10FF  ; PVALID     # GEORGIAN LETTER AEN..GEORGIAN LETTER LABIAL S16F1..16F8  ; PVALID     # RUNIC LETTER K..RUNIC LETTER FRANKS CASKET AE17B4..17B5  ; DISALLOWED # KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT191D..191E  ; PVALID     # LIMBU LETTER GYAN..LIMBU LETTER TRA1AB0..1ABD  ; PVALID     # COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBININ1ABE        ; DISALLOWED # COMBINING PARENTHESES OVERLAY1BAB..1BAD  ; PVALID     # SUNDANESE SIGN VIRAMA..SUNDANESE CONSONANT SI1BBA..1BBF  ; PVALID     # SUNDANESE AVAGRAHA..SUNDANESE LETTER FINAL M1CC0..1CC7  ; DISALLOWED # SUNDANESE PUNCTUATION BINDU SURYA..SUNDANESE1CF3..1CF6  ; PVALID     # VEDIC SIGN ROTATED ARDHAVISARGA..VEDIC SIGN U1CF8..1CF9  ; PVALID     # VEDIC TONE RING ABOVE..VEDIC TONE DOUBLE RING1DE7..1DF5  ; PVALID     # COMBINING LATIN SMALL LETTER ALPHA..COMBINING2066..2069  ; DISALLOWED # LEFT-TO-RIGHT ISOLATE..POP DIRECTIONAL ISOLAT20BA..20BD  ; DISALLOWED # TURKISH LIRA SIGN..RUBLE SIGN23F4..23FA  ; DISALLOWED # BLACK MEDIUM LEFT-POINTING TRIANGLE..BLACK CI2700        ; DISALLOWED # BLACK SAFETY SCISSORS27CB        ; DISALLOWED # MATHEMATICAL RISING DIAGONAL27CD        ; DISALLOWED # MATHEMATICAL FALLING DIAGONAL2B4D..2B4F  ; DISALLOWED # DOWNWARDS TRIANGLE-HEADED ZIGZAG ARROW..SHORT2B5A..2B73  ; DISALLOWED # SLANTED NORTH ARROW WITH HOOKED HEAD..DOWNWAR2B76..2B95  ; DISALLOWED # NORTH WEST TRIANGLE-HEADED ARROW TO BAR..RIGH2B98..2BB9  ; DISALLOWED # THREE-D TOP-LIGHTED LEFTWARDS EQUILATERAL ARR2BBD..2BC8  ; DISALLOWED # BALLOT BOX WITH LIGHT X..BLACK MEDIUM RIGHT-P2BCA..2BD1  ; DISALLOWED # TOP HALF BLACK CIRCLE..UNCERTAINTY SIGN2CF2        ; DISALLOWED # COPTIC CAPITAL LETTER BOHAIRIC KHEI2CF3        ; PVALID     # COPTIC SMALL LETTER BOHAIRIC KHEI2D27        ; PVALID     # GEORGIAN SMALL LETTER YN2D2D        ; PVALID     # GEORGIAN SMALL LETTER AEN2D66..2D67  ; PVALID     # TIFINAGH LETTER YE..TIFINAGH LETTER YO2E32..2E42  ; DISALLOWED # TURNED COMMA..DOUBLE LOW-REVERSED-9 QUOTATION9FCC        ; PVALID     # <CJK Ideograph>A674..A67B  ; PVALID     # COMBINING CYRILLIC LETTER UKRAINIAN IE..COMBIA698        ; DISALLOWED # CYRILLIC CAPITAL LETTER DOUBLE OA699        ; PVALID     # CYRILLIC SMALL LETTER DOUBLE OA69A        ; DISALLOWED # CYRILLIC CAPITAL LETTER CROSSED OA69B        ; PVALID     # CYRILLIC SMALL LETTER CROSSED OA69C..A69D  ; DISALLOWED # MODIFIER LETTER CYRILLIC HARD SIGN..MODIFIERA69F        ; PVALID     # COMBINING CYRILLIC LETTER IOTIFIED EA792        ; DISALLOWED # LATIN CAPITAL LETTER C WITH BARA793..A795  ; PVALID     # LATIN SMALL LETTER C WITH BAR..LATIN SMALL LEA796        ; DISALLOWED # LATIN CAPITAL LETTER B WITH FLOURISHA797        ; PVALID     # LATIN SMALL LETTER B WITH FLOURISHA798        ; DISALLOWED # LATIN CAPITAL LETTER F WITH STROKEA799        ; PVALID     # LATIN SMALL LETTER F WITH STROKEA79A        ; DISALLOWED # LATIN CAPITAL LETTER VOLAPUK AEA79B        ; PVALID     # LATIN SMALL LETTER VOLAPUK AEA79C        ; DISALLOWED # LATIN CAPITAL LETTER VOLAPUK OEA79D        ; PVALID     # LATIN SMALL LETTER VOLAPUK OEA79E        ; DISALLOWED # LATIN CAPITAL LETTER VOLAPUK UEA79F        ; PVALID     # LATIN SMALL LETTER VOLAPUK UEA7AA..A7AD  ; DISALLOWED # LATIN CAPITAL LETTER H WITH HOOK..LATIN CAPITA7B0..A7B1  ; DISALLOWED # LATIN CAPITAL LETTER TURNED K..LATIN CAPITALA7F7        ; PVALID     # LATIN EPIGRAPHIC LETTER SIDEWAYS IA7F8..A7F9  ; DISALLOWED # MODIFIER LETTER CAPITAL H WITH STROKE..MODIFIA9E0..A9FE  ; PVALID     # MYANMAR LETTER SHAN GHA..MYANMAR LETTER TAI LAA7C..AA7F  ; PVALID     # MYANMAR SIGN TAI LAING TONE-2..MYANMAR LETTERAAE0..AAEF  ; PVALID     # MEETEI MAYEK LETTER E..MEETEI MAYEK VOWEL SIGAAF0..AAF1  ; DISALLOWED # MEETEI MAYEK CHEIKHAN..MEETEI MAYEK AHANG KHUAAF2..AAF6  ; PVALID     # MEETEI MAYEK ANJI..MEETEI MAYEK VIRAMAAB30..AB5A  ; PVALID     # LATIN SMALL LETTER BARRED ALPHA..LATIN SMALLAB5B..AB5F  ; DISALLOWED # MODIFIER BREVE WITH INVERTED BREVE..MODIFIERAB64..AB65  ; PVALID     # LATIN SMALL LETTER INVERTED ALPHA..GREEK LETTFA2E..FA2F  ; DISALLOWED # CJK COMPATIBILITY IDEOGRAPH-FA2E..CJK COMPATIFE27..FE2D  ; PVALID     # COMBINING LIGATURE LEFT HALF BELOW..COMBINING1018B..1018C; DISALLOWED # GREEK ONE QUARTER SIGN..GREEK SINUSOID SIGN101A0       ; DISALLOWED # GREEK SYMBOL TAU RHO102E0       ; PVALID     # COPTIC EPACT THOUSANDS MARK102E1..102FB; DISALLOWED # COPTIC EPACT DIGIT ONE..COPTIC EPACT NUMBER N1031F       ; PVALID     # OLD ITALIC LETTER ESS10350..1037A; PVALID     # OLD PERMIC LETTER AN..COMBINING OLD PERMIC LE10500..10527; PVALID     # ELBASAN LETTER A..ELBASAN LETTER KHE10530..10563; PVALID     # CAUCASIAN ALBANIAN LETTER ALT..CAUCASIAN ALBA1056F       ; DISALLOWED # CAUCASIAN ALBANIAN CITATION MARK10600..10736; PVALID     # LINEAR A SIGN AB001..LINEAR A SIGN A66410740..10755; PVALID     # LINEAR A SIGN A701 A..LINEAR A SIGN A732 JE10760..10767; PVALID     # LINEAR A SIGN A800..LINEAR A SIGN A80710860..10876; PVALID     # PALMYRENE LETTER ALEPH..PALMYRENE LETTER TAW10877..1087F; DISALLOWED # PALMYRENE LEFT-POINTING FLEURON..PALMYRENE NU10880..1089E; PVALID     # NABATAEAN LETTER FINAL ALEPH..NABATAEAN LETTE108A7..108AF; DISALLOWED # NABATAEAN NUMBER ONE..NABATAEAN NUMBER ONE HU10980..109B7; PVALID     # MEROITIC HIEROGLYPHIC LETTER A..MEROITIC CURS109BE..109BF; PVALID     # MEROITIC CURSIVE LOGOGRAM RMT..MEROITIC CURSI10A80..10A9C; PVALID     # OLD NORTH ARABIAN LETTER HEH..OLD NORTH ARABI10A9D..10A9F; DISALLOWED # OLD NORTH ARABIAN NUMBER ONE..OLD NORTH ARABI10AC0..10AC7; PVALID     # MANICHAEAN LETTER ALEPH..MANICHAEAN LETTER WA10AC8       ; DISALLOWED # MANICHAEAN SIGN UD10AC9..10AE6; PVALID     # MANICHAEAN LETTER ZAYIN..MANICHAEAN ABBREVIAT10AEB..10AF6; DISALLOWED # MANICHAEAN NUMBER ONE..MANICHAEAN PUNCTUATION10B80..10B91; PVALID     # PSALTER PAHLAVI LETTER ALEPH..PSALTER PAHLAVI10B99..10B9C; DISALLOWED # PSALTER PAHLAVI SECTION MARK..PSALTER PAHLAVI10BA9..10BAF; DISALLOWED # PSALTER PAHLAVI NUMBER ONE..PSALTER PAHLAVI N1107F       ; PVALID     # BRAHMI NUMBER JOINER110D0..110E8; PVALID     # SORA SOMPENG LETTER SAH..SORA SOMPENG LETTER110F0..110F9; PVALID     # SORA SOMPENG DIGIT ZERO..SORA SOMPENG DIGIT N11100..11134; PVALID     # CHAKMA SIGN CANDRABINDU..CHAKMA MAAYYAA11136..1113F; PVALID     # CHAKMA DIGIT ZERO..CHAKMA DIGIT NINE11140..11143; DISALLOWED # CHAKMA SECTION MARK..CHAKMA QUESTION MARK11150..11173; PVALID     # MAHAJANI LETTER A..MAHAJANI SIGN NUKTA11174..11175; DISALLOWED # MAHAJANI ABBREVIATION SIGN..MAHAJANI SECTION11176       ; PVALID     # MAHAJANI LIGATURE SHRI11180..111C4; PVALID     # SHARADA SIGN CANDRABINDU..SHARADA OM111C5..111C8; DISALLOWED # SHARADA DANDA..SHARADA SEPARATOR111CD       ; DISALLOWED # SHARADA SUTRA MARK111D0..111DA; PVALID     # SHARADA DIGIT ZERO..SHARADA EKAM111E1..111F4; DISALLOWED # SINHALA ARCHAIC DIGIT ONE..SINHALA ARCHAIC NU11200..11211; PVALID     # KHOJKI LETTER A..KHOJKI LETTER JJA11213..11237; PVALID     # KHOJKI LETTER NYA..KHOJKI SIGN SHADDA11238..1123D; DISALLOWED # KHOJKI DANDA..KHOJKI ABBREVIATION SIGN112B0..112EA; PVALID     # KHUDAWADI LETTER A..KHUDAWADI SIGN VIRAMA112F0..112F9; PVALID     # KHUDAWADI DIGIT ZERO..KHUDAWADI DIGIT NINE11301..11303; PVALID     # GRANTHA SIGN CANDRABINDU..GRANTHA SIGN VISARG11305..1130C; PVALID     # GRANTHA LETTER A..GRANTHA LETTER VOCALIC L1130F..11310; PVALID     # GRANTHA LETTER EE..GRANTHA LETTER AI11313..11328; PVALID     # GRANTHA LETTER OO..GRANTHA LETTER NA1132A..11330; PVALID     # GRANTHA LETTER PA..GRANTHA LETTER RA11332..11333; PVALID     # GRANTHA LETTER LA..GRANTHA LETTER LLA11335..11339; PVALID     # GRANTHA LETTER VA..GRANTHA LETTER HA1133C..11344; PVALID     # GRANTHA SIGN NUKTA..GRANTHA VOWEL SIGN VOCALI11347..11348; PVALID     # GRANTHA VOWEL SIGN EE..GRANTHA VOWEL SIGN AI1134B..1134D; PVALID     # GRANTHA VOWEL SIGN OO..GRANTHA SIGN VIRAMA11357       ; PVALID     # GRANTHA AU LENGTH MARK1135D..11363; PVALID     # GRANTHA SIGN PLUTA..GRANTHA VOWEL SIGN VOCALI11366..1136C; PVALID     # COMBINING GRANTHA DIGIT ZERO..COMBINING GRANT11370..11374; PVALID     # COMBINING GRANTHA LETTER A..COMBINING GRANTHA11480..114C5; PVALID     # TIRHUTA ANJI..TIRHUTA GVANG114C6       ; DISALLOWED # TIRHUTA ABBREVIATION SIGN114C7       ; PVALID     # TIRHUTA OM114D0..114D9; PVALID     # TIRHUTA DIGIT ZERO..TIRHUTA DIGIT NINE11580..115B5; PVALID     # SIDDHAM LETTER A..SIDDHAM VOWEL SIGN VOCALIC115B8..115C0; PVALID     # SIDDHAM VOWEL SIGN E..SIDDHAM SIGN NUKTA115C1..115C9; DISALLOWED # SIDDHAM SIGN SIDDHAM..SIDDHAM END OF TEXT MAR11600..11640; PVALID     # MODI LETTER A..MODI SIGN ARDHACANDRA11641..11643; DISALLOWED # MODI DANDA..MODI ABBREVIATION SIGN11644       ; PVALID     # MODI SIGN HUVA11650..11659; PVALID     # MODI DIGIT ZERO..MODI DIGIT NINE11680..116B7; PVALID     # TAKRI LETTER A..TAKRI SIGN NUKTA116C0..116C9; PVALID     # TAKRI DIGIT ZERO..TAKRI DIGIT NINE118A0..118BF; DISALLOWED # WARANG CITI CAPITAL LETTER NGAA..WARANG CITI118C0..118E9; PVALID     # WARANG CITI SMALL LETTER NGAA..WARANG CITI DI118EA..118F2; DISALLOWED # WARANG CITI NUMBER TEN..WARANG CITI NUMBER NI118FF       ; PVALID     # WARANG CITI OM11AC0..11AF8; PVALID     # PAU CIN HAU LETTER PA..PAU CIN HAU GLOTTAL ST1236F..12398; PVALID     # CUNEIFORM SIGN KAP ELAMITE..CUNEIFORM SIGN UM12463..1246E; DISALLOWED # CUNEIFORM NUMERIC SIGN ONE QUARTER GUR..CUNEI12474       ; DISALLOWED # CUNEIFORM PUNCTUATION SIGN DIAGONAL QUADCOLON16A40..16A5E; PVALID     # MRO LETTER TA..MRO LETTER TEK16A60..16A69; PVALID     # MRO DIGIT ZERO..MRO DIGIT NINE16A6E..16A6F; DISALLOWED # MRO DANDA..MRO DOUBLE DANDA16AD0..16AED; PVALID     # BASSA VAH LETTER ENNI..BASSA VAH LETTER I16AF0..16AF4; PVALID     # BASSA VAH COMBINING HIGH TONE..BASSA VAH COMB16AF5       ; DISALLOWED # BASSA VAH FULL STOP16B00..16B36; PVALID     # PAHAWH HMONG VOWEL KEEB..PAHAWH HMONG MARK CI16B37..16B3F; DISALLOWED # PAHAWH HMONG SIGN VOS THOM..PAHAWH HMONG SIGN16B40..16B43; PVALID     # PAHAWH HMONG SIGN VOS SEEV..PAHAWH HMONG SIGN16B44..16B45; DISALLOWED # PAHAWH HMONG SIGN XAUS..PAHAWH HMONG SIGN CIM16B50..16B59; PVALID     # PAHAWH HMONG DIGIT ZERO..PAHAWH HMONG DIGIT N16B5B..16B61; DISALLOWED # PAHAWH HMONG NUMBER TENS..PAHAWH HMONG NUMBER16B63..16B77; PVALID     # PAHAWH HMONG SIGN VOS LUB..PAHAWH HMONG SIGN16B7D..16B8F; PVALID     # PAHAWH HMONG CLAN SIGN TSHEEJ..PAHAWH HMONG C16F00..16F44; PVALID     # MIAO LETTER PA..MIAO LETTER HHA16F50..16F7E; PVALID     # MIAO LETTER NASALIZATION..MIAO VOWEL SIGN NG16F8F..16F9F; PVALID     # MIAO TONE RIGHT..MIAO LETTER REFORMED TONE-81BC00..1BC6A; PVALID     # DUPLOYAN LETTER H..DUPLOYAN LETTER VOCALIC M1BC70..1BC7C; PVALID     # DUPLOYAN AFFIX LEFT HORIZONTAL SECANT..DUPLOY1BC80..1BC88; PVALID     # DUPLOYAN AFFIX HIGH ACUTE..DUPLOYAN AFFIX HIG1BC90..1BC99; PVALID     # DUPLOYAN AFFIX LOW ACUTE..DUPLOYAN AFFIX LOW1BC9C       ; DISALLOWED # DUPLOYAN SIGN O WITH CROSS1BC9D..1BC9E; PVALID     # DUPLOYAN THICK LETTER SELECTOR..DUPLOYAN DOUB1BC9F..1BCA3; DISALLOWED # DUPLOYAN PUNCTUATION CHINOOK FULL STOP..SHORT1E800..1E8C4; PVALID     # MENDE KIKAKUI SYLLABLE M001 KI..MENDE KIKAKUI1E8C7..1E8CF; DISALLOWED # MENDE KIKAKUI DIGIT ONE..MENDE KIKAKUI DIGIT1E8D0..1E8D6; PVALID     # MENDE KIKAKUI COMBINING NUMBER TEENS..MENDE K1EE00..1EE03; DISALLOWED # ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL1EE05..1EE1F; DISALLOWED # ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL1EE21..1EE22; DISALLOWED # ARABIC MATHEMATICAL INITIAL BEH..ARABIC MATHE1EE24       ; DISALLOWED # ARABIC MATHEMATICAL INITIAL HEH1EE27       ; DISALLOWED # ARABIC MATHEMATICAL INITIAL HAH1EE29..1EE32; DISALLOWED # ARABIC MATHEMATICAL INITIAL YEH..ARABIC MATHE1EE34..1EE37; DISALLOWED # ARABIC MATHEMATICAL INITIAL SHEEN..ARABIC MAT1EE39       ; DISALLOWED # ARABIC MATHEMATICAL INITIAL DAD1EE3B       ; DISALLOWED # ARABIC MATHEMATICAL INITIAL GHAIN1EE42       ; DISALLOWED # ARABIC MATHEMATICAL TAILED JEEM1EE47       ; DISALLOWED # ARABIC MATHEMATICAL TAILED HAH1EE49       ; DISALLOWED # ARABIC MATHEMATICAL TAILED YEH1EE4B       ; DISALLOWED # ARABIC MATHEMATICAL TAILED LAM1EE4D..1EE4F; DISALLOWED # ARABIC MATHEMATICAL TAILED NOON..ARABIC MATHE1EE51..1EE52; DISALLOWED # ARABIC MATHEMATICAL TAILED SAD..ARABIC MATHEM1EE54       ; DISALLOWED # ARABIC MATHEMATICAL TAILED SHEEN1EE57       ; DISALLOWED # ARABIC MATHEMATICAL TAILED KHAH1EE59       ; DISALLOWED # ARABIC MATHEMATICAL TAILED DAD1EE5B       ; DISALLOWED # ARABIC MATHEMATICAL TAILED GHAIN1EE5D       ; DISALLOWED # ARABIC MATHEMATICAL TAILED DOTLESS NOON1EE5F       ; DISALLOWED # ARABIC MATHEMATICAL TAILED DOTLESS QAF1EE61..1EE62; DISALLOWED # ARABIC MATHEMATICAL STRETCHED BEH..ARABIC MAT1EE64       ; DISALLOWED # ARABIC MATHEMATICAL STRETCHED HEH1EE67..1EE6A; DISALLOWED # ARABIC MATHEMATICAL STRETCHED HAH..ARABIC MAT1EE6C..1EE72; DISALLOWED # ARABIC MATHEMATICAL STRETCHED MEEM..ARABIC MA1EE74..1EE77; DISALLOWED # ARABIC MATHEMATICAL STRETCHED SHEEN..ARABIC M1EE79..1EE7C; DISALLOWED # ARABIC MATHEMATICAL STRETCHED DAD..ARABIC MAT1EE7E       ; DISALLOWED # ARABIC MATHEMATICAL STRETCHED DOTLESS FEH1EE80..1EE89; DISALLOWED # ARABIC MATHEMATICAL LOOPED ALEF..ARABIC MATHE1EE8B..1EE9B; DISALLOWED # ARABIC MATHEMATICAL LOOPED LAM..ARABIC MATHEM1EEA1..1EEA3; DISALLOWED # ARABIC MATHEMATICAL DOUBLE-STRUCK BEH..ARABIC1EEA5..1EEA9; DISALLOWED # ARABIC MATHEMATICAL DOUBLE-STRUCK WAW..ARABIC1EEAB..1EEBB; DISALLOWED # ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC1EEF0..1EEF1; DISALLOWED # ARABIC MATHEMATICAL OPERATOR MEEM WITH HAH WI1F0BF       ; DISALLOWED # PLAYING CARD RED JOKER1F0E0..1F0F5; DISALLOWED # PLAYING CARD FOOL..PLAYING CARD TRUMP-211F10B..1F10C; DISALLOWED # DINGBAT CIRCLED SANS-SERIF DIGIT ZERO..DINGBA1F16A..1F16B; DISALLOWED # RAISED MC SIGN..RAISED MD SIGN1F321..1F32C; DISALLOWED # THERMOMETER..WIND BLOWING FACE1F336       ; DISALLOWED # HOT PEPPER1F37D       ; DISALLOWED # FORK AND KNIFE WITH PLATE1F394..1F39F; DISALLOWED # HEART WITH TIP ON THE LEFT..ADMISSION TICKETS1F3C5       ; DISALLOWED # SPORTS MEDAL1F3CB..1F3CE; DISALLOWED # WEIGHT LIFTER..RACING CAR1F3D4..1F3DF; DISALLOWED # SNOW CAPPED MOUNTAIN..STADIUM1F3F1..1F3F7; DISALLOWED # WHITE PENNANT..LABEL1F43F       ; DISALLOWED # CHIPMUNK1F441       ; DISALLOWED # EYE1F4F8       ; DISALLOWED # CAMERA WITH FLASH1F4FD..1F4FE; DISALLOWED # FILM PROJECTOR..PORTABLE STEREO1F53E..1F54A; DISALLOWED # LOWER RIGHT SHADOWED WHITE CIRCLE..DOVE OF PE1F568..1F579; DISALLOWED # RIGHT SPEAKER..JOYSTICK1F57B..1F5A3; DISALLOWED # LEFT HAND TELEPHONE RECEIVER..BLACK DOWN POIN1F5A5..1F5FA; DISALLOWED # DESKTOP COMPUTER..WORLD MAP1F600       ; DISALLOWED # GRINNING FACE1F611       ; DISALLOWED # EXPRESSIONLESS FACE1F615       ; DISALLOWED # CONFUSED FACE1F617       ; DISALLOWED # KISSING FACE1F619       ; DISALLOWED # KISSING FACE WITH SMILING EYES1F61B       ; DISALLOWED # FACE WITH STUCK-OUT TONGUE1F61F       ; DISALLOWED # WORRIED FACE1F626..1F627; DISALLOWED # FROWNING FACE WITH OPEN MOUTH..ANGUISHED FACE1F62C       ; DISALLOWED # GRIMACING FACE1F62E..1F62F; DISALLOWED # FACE WITH OPEN MOUTH..HUSHED FACE1F634       ; DISALLOWED # SLEEPING FACE1F641..1F642; DISALLOWED # SLIGHTLY FROWNING FACE..SLIGHTLY SMILING FACE1F650..1F67F; DISALLOWED # NORTH WEST POINTING LEAF..REVERSE CHECKER BOA1F6C6..1F6CF; DISALLOWED # TRIANGLE WITH ROUNDED CORNERS..BED1F6E0..1F6EC; DISALLOWED # HAMMER AND WRENCH..AIRPLANE ARRIVING1F6F0..1F6F3; DISALLOWED # SATELLITE..PASSENGER SHIP1F780..1F7D4; DISALLOWED # BLACK LEFT-POINTING ISOSCELES RIGHT TRIANGLE.1F800..1F80B; DISALLOWED # LEFTWARDS ARROW WITH SMALL TRIANGLE ARROWHEAD1F810..1F847; DISALLOWED # LEFTWARDS ARROW WITH SMALL EQUILATERAL ARROWH1F850..1F859; DISALLOWED # LEFTWARDS SANS-SERIF ARROW..UP DOWN SANS-SERI1F860..1F887; DISALLOWED # WIDE-HEADED LEFTWARDS LIGHT BARB ARROW..WIDE-1F890..1F8AD; DISALLOWED # LEFTWARDS TRIANGLE ARROWHEAD..WHITE ARROW SHA

Appendix B.Changes from Unicode 7.0.0 to Unicode 8.0.0

Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED.

08B3..08B4  ; PVALID     # ARABIC LETTER AIN WITH THREE DOTS BELOW..ARAB08E3        ; PVALID     # ARABIC TURNED DAMMA BELOW0AF9        ; PVALID     # GUJARATI LETTER ZHA0C5A        ; PVALID     # TELUGU LETTER RRRA0D5F        ; PVALID     # MALAYALAM LETTER ARCHAIC II13F5        ; PVALID     # CHEROKEE LETTER MV13F8..13FD  ; DISALLOWED # CHEROKEE SMALL LETTER YE..CHEROKEE SMALL LETT20BE        ; DISALLOWED # LARI SIGN218A..218B  ; DISALLOWED # TURNED DIGIT TWO..TURNED DIGIT THREE2BEC..2BEF  ; DISALLOWED # LEFTWARDS TWO-HEADED ARROW WITH TRIANGLE ARRO9FCD..9FD5  ; PVALID     # <CJK Ideograph>..<CJK Ideograph>A69E        ; PVALID     # COMBINING CYRILLIC LETTER EFA78F        ; PVALID     # LATIN LETTER SINOLOGICAL DOTA7B2..A7B4  ; DISALLOWED # LATIN CAPITAL LETTER J WITH CROSSED-TAIL..LATA7B5        ; PVALID     # LATIN SMALL LETTER BETAA7B6        ; DISALLOWED # LATIN CAPITAL LETTER OMEGAA7B7        ; PVALID     # LATIN SMALL LETTER OMEGAA8FC        ; DISALLOWED # DEVANAGARI SIGN SIDDHAMA8FD        ; PVALID     # DEVANAGARI JAIN OMAB60..AB63  ; PVALID     # LATIN SMALL LETTER SAKHA YAT..LATIN SMALL LETAB70..ABBF  ; DISALLOWED # CHEROKEE SMALL LETTER A..CHEROKEE SMALL LETTEFE2E..FE2F  ; PVALID     # COMBINING CYRILLIC TITLO LEFT HALF..COMBINING108E0..108F2; PVALID     # HATRAN LETTER ALEPH..HATRAN LETTER QOPH108F4..108F5; PVALID     # HATRAN LETTER SHIN..HATRAN LETTER TAW108FB..108FF; DISALLOWED # HATRAN NUMBER ONE..HATRAN NUMBER ONE HUNDRED109BC..109BD; DISALLOWED # MEROITIC CURSIVE FRACTION ELEVEN TWELFTHS..ME109C0..109CF; DISALLOWED # MEROITIC CURSIVE NUMBER ONE..MEROITIC CURSIVE109D2..109FF; DISALLOWED # MEROITIC CURSIVE NUMBER ONE HUNDRED..MEROITIC10C80..10CB2; DISALLOWED # OLD HUNGARIAN CAPITAL LETTER A..OLD HUNGARIAN10CC0..10CF2; PVALID     # OLD HUNGARIAN SMALL LETTER A..OLD HUNGARIAN S10CFA..10CFF; DISALLOWED # OLD HUNGARIAN NUMBER ONE..OLD HUNGARIAN NUMBE111C9       ; DISALLOWED # SHARADA SANDHI MARK111CA..111CC; PVALID     # SHARADA SIGN NUKTA..SHARADA EXTRA SHORT VOWEL111DB       ; DISALLOWED # SHARADA SIGN SIDDHAM111DC       ; PVALID     # SHARADA HEADSTROKE111DD..111DF; DISALLOWED # SHARADA CONTINUATION SIGN..SHARADA SECTION MA11280..11286; PVALID     # MULTANI LETTER A..MULTANI LETTER GA11288       ; PVALID     # MULTANI LETTER GHA1128A..1128D; PVALID     # MULTANI LETTER CA..MULTANI LETTER JJA1128F..1129D; PVALID     # MULTANI LETTER NYA..MULTANI LETTER BA1129F..112A8; PVALID     # MULTANI LETTER BHA..MULTANI LETTER RHA112A9       ; DISALLOWED # MULTANI SECTION MARK11300       ; PVALID     # GRANTHA SIGN COMBINING ANUSVARA ABOVE11350       ; PVALID     # GRANTHA OM115CA..115D7; DISALLOWED # SIDDHAM SECTION MARK WITH TRIDENT AND U-SHAPE115D8..115DD; PVALID     # SIDDHAM LETTER THREE-CIRCLE ALTERNATE I..SIDD11700..11719; PVALID     # AHOM LETTER KA..AHOM LETTER JHA1171D..1172B; PVALID     # AHOM CONSONANT SIGN MEDIAL LA..AHOM SIGN KILL11730..11739; PVALID     # AHOM DIGIT ZERO..AHOM DIGIT NINE1173A..1173F; DISALLOWED # AHOM NUMBER TEN..AHOM SYMBOL VI12399       ; PVALID     # CUNEIFORM SIGN U U12480..12543; PVALID     # CUNEIFORM SIGN AB TIMES NUN TENU..CUNEIFORM S14400..14646; PVALID     # ANATOLIAN HIEROGLYPH A001..ANATOLIAN HIEROGLY1D1DE..1D1E8; DISALLOWED # MUSICAL SYMBOL KIEVAN C CLEF..MUSICAL SYMBOL1D800..1D9FF; DISALLOWED # SIGNWRITING HAND-FIST INDEX..SIGNWRITING HEAD1DA00..1DA36; PVALID     # SIGNWRITING HEAD RIM..SIGNWRITING AIR SUCKING1DA37..1DA3A; DISALLOWED # SIGNWRITING AIR BLOW SMALL ROTATIONS..SIGNWRI1DA3B..1DA6C; PVALID     # SIGNWRITING MOUTH CLOSED NEUTRAL..SIGNWRITING1DA6D..1DA74; DISALLOWED # SIGNWRITING SHOULDER HIP SPINE..SIGNWRITING T1DA75       ; PVALID     # SIGNWRITING UPPER BODY TILTING FROM HIP JOINT1DA76..1DA83; DISALLOWED # SIGNWRITING LIMB COMBINATION..SIGNWRITING LOC1DA84       ; PVALID     # SIGNWRITING LOCATION HEAD NECK1DA85..1DA8B; DISALLOWED # SIGNWRITING LOCATION TORSO..SIGNWRITING PAREN1DA9B..1DA9F; PVALID     # SIGNWRITING FILL MODIFIER-2..SIGNWRITING FILL1DAA1..1DAAF; PVALID     # SIGNWRITING ROTATION MODIFIER-2..SIGNWRITING1F32D..1F32F; DISALLOWED # HOT DOG..BURRITO1F37E..1F37F; DISALLOWED # BOTTLE WITH POPPING CORK..POPCORN1F3CF..1F3D3; DISALLOWED # CRICKET BAT AND BALL..TABLE TENNIS PADDLE AND1F3F8..1F3FF; DISALLOWED # BADMINTON RACQUET AND SHUTTLECOCK..EMOJI MODI1F4FF       ; DISALLOWED # PRAYER BEADS1F54B..1F54F; DISALLOWED # KAABA..BOWL OF HYGIEIA1F643..1F644; DISALLOWED # UPSIDE-DOWN FACE..FACE WITH ROLLING EYES1F6D0       ; DISALLOWED # PLACE OF WORSHIP1F910..1F918; DISALLOWED # ZIPPER-MOUTH FACE..SIGN OF THE HORNS1F980..1F984; DISALLOWED # CRAB..UNICORN FACE1F9C0       ; DISALLOWED # CHEESE WEDGE2B820..2CEA1; PVALID     # <CJK Ideograph Extension E>..<CJK Ideograph E

Appendix C.Changes from Unicode 8.0.0 to Unicode 9.0.0

Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED.

08B6..08BD  ; PVALID     # ARABIC LETTER BEH WITH SMALL MEEM ABOVE..ARAB08D4..08E1  ; PVALID     # ARABIC SMALL HIGH WORD AR-RUB..ARABIC SMALL H08E2        ; DISALLOWED # ARABIC DISPUTED END OF AYAH0C80        ; PVALID     # KANNADA SIGN SPACING CANDRABINDU0D4F        ; DISALLOWED # MALAYALAM SIGN PARA0D54..0D56  ; PVALID     # MALAYALAM LETTER CHILLU M..MALAYALAM LETTER C0D58..0D5E  ; DISALLOWED # MALAYALAM FRACTION ONE ONE-HUNDRED-AND-SIXTIE0D76..0D78  ; DISALLOWED # MALAYALAM FRACTION ONE SIXTEENTH..MALAYALAM F1C80..1C88  ; DISALLOWED # CYRILLIC SMALL LETTER ROUNDED VE..CYRILLIC SM1DFB        ; PVALID     # COMBINING DELETION MARK23FB..23FE  ; DISALLOWED # POWER SYMBOL..POWER SLEEP SYMBOL2E43..2E44  ; DISALLOWED # DASH WITH LEFT UPTURN..DOUBLE SUSPENSION MARKA7AE        ; DISALLOWED # LATIN CAPITAL LETTER SMALL CAPITAL IA8C5        ; PVALID     # SAURASHTRA SIGN CANDRABINDU1018D..1018E; DISALLOWED # GREEK INDICTION SIGN..NOMISMA SIGN104B0..104D3; DISALLOWED # OSAGE CAPITAL LETTER A..OSAGE CAPITAL LETTER104D8..104FB; PVALID     # OSAGE SMALL LETTER A..OSAGE SMALL LETTER ZHA1123E       ; PVALID     # KHOJKI SIGN SUKUN11400..1144A; PVALID     # NEWA LETTER A..NEWA SIDDHI1144B..1144F; DISALLOWED # NEWA DANDA..NEWA ABBREVIATION SIGN11450..11459; PVALID     # NEWA DIGIT ZERO..NEWA DIGIT NINE1145B       ; DISALLOWED # NEWA PLACEHOLDER MARK1145D       ; DISALLOWED # NEWA INSERTION SIGN11660..1166C; DISALLOWED # MONGOLIAN BIRGA WITH ORNAMENT..MONGOLIAN TURN11C00..11C08; PVALID     # BHAIKSUKI LETTER A..BHAIKSUKI LETTER VOCALIC11C0A..11C36; PVALID     # BHAIKSUKI LETTER E..BHAIKSUKI VOWEL SIGN VOCA11C38..11C40; PVALID     # BHAIKSUKI VOWEL SIGN E..BHAIKSUKI SIGN AVAGRA11C41..11C45; DISALLOWED # BHAIKSUKI DANDA..BHAIKSUKI GAP FILLER-211C50..11C59; PVALID     # BHAIKSUKI DIGIT ZERO..BHAIKSUKI DIGIT NINE11C5A..11C6C; DISALLOWED # BHAIKSUKI NUMBER ONE..BHAIKSUKI HUNDREDS UNIT11C70..11C71; DISALLOWED # MARCHEN HEAD MARK..MARCHEN MARK SHAD11C72..11C8F; PVALID     # MARCHEN LETTER KA..MARCHEN LETTER A11C92..11CA7; PVALID     # MARCHEN SUBJOINED LETTER KA..MARCHEN SUBJOINE11CA9..11CB6; PVALID     # MARCHEN SUBJOINED LETTER YA..MARCHEN SIGN CAN16FE0       ; PVALID     # TANGUT ITERATION MARK17000..187EC; PVALID     # <Tangut Ideograph>..<Tangut Ideograph>18800..18AF2; PVALID     # TANGUT COMPONENT-001..TANGUT COMPONENT-7551E000..1E006; PVALID     # COMBINING GLAGOLITIC LETTER AZU..COMBINING GL1E008..1E018; PVALID     # COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING1E01B..1E021; PVALID     # COMBINING GLAGOLITIC LETTER SHTA..COMBINING G1E023..1E024; PVALID     # COMBINING GLAGOLITIC LETTER YU..COMBINING GLA1E026..1E02A; PVALID     # COMBINING GLAGOLITIC LETTER YO..COMBINING GLA1E900..1E921; DISALLOWED # ADLAM CAPITAL LETTER ALIF..ADLAM CAPITAL LETT1E922..1E94A; PVALID     # ADLAM SMALL LETTER ALIF..ADLAM NUKTA1E950..1E959; PVALID     # ADLAM DIGIT ZERO..ADLAM DIGIT NINE1E95E..1E95F; DISALLOWED # ADLAM INITIAL EXCLAMATION MARK..ADLAM INITIAL1F19B..1F1AC; DISALLOWED # SQUARED THREE D..SQUARED VOD1F23B       ; DISALLOWED # SQUARED CJK UNIFIED IDEOGRAPH-914D1F57A       ; DISALLOWED # MAN DANCING1F5A4       ; DISALLOWED # BLACK HEART1F6D1..1F6D2; DISALLOWED # OCTAGONAL SIGN..SHOPPING TROLLEY1F6F4..1F6F6; DISALLOWED # SCOOTER..CANOE1F919..1F91E; DISALLOWED # CALL ME HAND..HAND WITH INDEX AND MIDDLE FING1F920..1F927; DISALLOWED # FACE WITH COWBOY HAT..SNEEZING FACE1F930       ; DISALLOWED # PREGNANT WOMAN1F933..1F93E; DISALLOWED # SELFIE..HANDBALL1F940..1F94B; DISALLOWED # WILTED FLOWER..MARTIAL ARTS UNIFORM1F950..1F95E; DISALLOWED # CROISSANT..PANCAKES1F985..1F991; DISALLOWED # EAGLE..SQUID

Appendix D.Changes from Unicode 9.0.0 to Unicode 10.0.0

Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED.

0860..086A  ; PVALID     # SYRIAC LETTER MALAYALAM NGA..SYRIAC LETTER MA09FC        ; PVALID     # BENGALI LETTER VEDIC ANUSVARA09FD        ; DISALLOWED # BENGALI ABBREVIATION SIGN0AFA..0AFF  ; PVALID     # GUJARATI SIGN SUKUN..GUJARATI SIGN TWO-CIRCLE0D00        ; PVALID     # MALAYALAM SIGN COMBINING ANUSVARA ABOVE0D3B..0D3C  ; PVALID     # MALAYALAM SIGN VERTICAL BAR VIRAMA..MALAYALAM1CF7        ; PVALID     # VEDIC SIGN ATIKRAMA1DF6..1DF9  ; PVALID     # COMBINING KAVYKA ABOVE RIGHT..COMBINING WIDE20BF        ; DISALLOWED # BITCOIN SIGN23FF        ; DISALLOWED # OBSERVER EYE SYMBOL2BD2        ; DISALLOWED # GROUP MARK2E45..2E49  ; DISALLOWED # INVERTED LOW KAVYKA..DOUBLE STACKED COMMA312E        ; PVALID     # BOPOMOFO LETTER O WITH DOT ABOVE9FD6..9FEA  ; PVALID     # <CJK Ideograph>..<CJK Ideograph>1032D..1032F; PVALID     # OLD ITALIC LETTER YE..OLD ITALIC LETTER SOUTH11A00..11A3E; PVALID     # ZANABAZAR SQUARE LETTER A..ZANABAZAR SQUARE C11A3F..11A46; DISALLOWED # ZANABAZAR SQUARE INITIAL HEAD MARK..ZANABAZAR11A47       ; PVALID     # ZANABAZAR SQUARE SUBJOINER11A50..11A83; PVALID     # SOYOMBO LETTER A..SOYOMBO LETTER KSSA11A86..11A99; PVALID     # SOYOMBO CLUSTER-INITIAL LETTER RA..SOYOMBO SU11A9A..11A9C; DISALLOWED # SOYOMBO MARK TSHEG..SOYOMBO MARK DOUBLE SHAD11A9E..11AA2; DISALLOWED # SOYOMBO HEAD MARK WITH MOON AND SUN AND TRIPL11D00..11D06; PVALID     # MASARAM GONDI LETTER A..MASARAM GONDI LETTER11D08..11D09; PVALID     # MASARAM GONDI LETTER AI..MASARAM GONDI LETTER11D0B..11D36; PVALID     # MASARAM GONDI LETTER AU..MASARAM GONDI VOWEL11D3A       ; PVALID     # MASARAM GONDI VOWEL SIGN E11D3C..11D3D; PVALID     # MASARAM GONDI VOWEL SIGN AI..MASARAM GONDI VO11D3F..11D47; PVALID     # MASARAM GONDI VOWEL SIGN AU..MASARAM GONDI RA11D50..11D59; PVALID     # MASARAM GONDI DIGIT ZERO..MASARAM GONDI DIGIT16FE1       ; PVALID     # NUSHU ITERATION MARK1B002..1B11E; PVALID     # HENTAIGANA LETTER A-1..HENTAIGANA LETTER N-MU1B170..1B2FB; PVALID     # NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB1F260..1F265; DISALLOWED # ROUNDED SYMBOL FOR FU..ROUNDED SYMBOL FOR CAI1F6D3..1F6D4; DISALLOWED # STUPA..PAGODA1F6F7..1F6F8; DISALLOWED # SLED..FLYING SAUCER1F900..1F90B; DISALLOWED # CIRCLED CROSS FORMEE WITH FOUR DOTS..DOWNWARD1F91F       ; DISALLOWED # I LOVE YOU HAND SIGN1F928..1F92F; DISALLOWED # FACE WITH ONE EYEBROW RAISED..SHOCKED FACE WI1F931..1F932; DISALLOWED # BREAST-FEEDING..PALMS UP TOGETHER1F94C       ; DISALLOWED # CURLING STONE1F95F..1F96B; DISALLOWED # DUMPLING..CANNED FOOD1F992..1F997; DISALLOWED # GIRAFFE FACE..CRICKET1F9D0..1F9E6; DISALLOWED # FACE WITH MONOCLE..SOCKS2CEB0..2EBE0; PVALID     # <CJK Ideograph Extension F>..<CJK Ideograph E

Appendix E.Changes from Unicode 10.0.0 to Unicode 11.0.0

Changes from derived property value DISALLOWED to PVALID.

111C9       ; PVALID      # SHARADA SANDHI MARK

Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED.

0560        ; PVALID     # ARMENIAN SMALL LETTER TURNED AYB0588        ; PVALID     # ARMENIAN SMALL LETTER YI WITH STROKE05EF        ; PVALID     # HEBREW YOD TRIANGLE07FD        ; PVALID     # NKO DANTAYALAN07FE..07FF  ; DISALLOWED # NKO DOROME SIGN..NKO TAMAN SIGN08D3        ; PVALID     # ARABIC SMALL LOW WAW09FE        ; PVALID     # BENGALI SANDHI MARK0A76        ; DISALLOWED # GURMUKHI ABBREVIATION SIGN0C04        ; PVALID     # TELUGU SIGN COMBINING ANUSVARA ABOVE0C84        ; DISALLOWED # KANNADA SIGN SIDDHAM1878        ; PVALID     # MONGOLIAN LETTER CHA WITH TWO DOTS1C90..1CBA  ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AN..GEORGIAN1CBD..1CBF  ; DISALLOWED # GEORGIAN MTAVRULI CAPITAL LETTER AEN..GEORGIA2BBA..2BBC  ; DISALLOWED # OVERLAPPING WHITE SQUARES..OVERLAPPING BLACK2BD3..2BEB  ; DISALLOWED # PLUTO FORM TWO..STAR WITH RIGHT HALF BLACK2BF0..2BFE  ; DISALLOWED # ERIS FORM ONE..REVERSED RIGHT ANGLE2E4A..2E4E  ; DISALLOWED # DOTTED SOLIDUS..PUNCTUS ELEVATUS MARK312F        ; PVALID     # BOPOMOFO LETTER NN9FEB..9FEF  ; PVALID     # <CJK Ideograph>..<CJK Ideograph>A7AF        ; PVALID     # LATIN LETTER SMALL CAPITAL QA7B8        ; DISALLOWED # LATIN CAPITAL LETTER U WITH STROKEA7B9        ; PVALID     # LATIN SMALL LETTER U WITH STROKEA8FE..A8FF  ; PVALID     # DEVANAGARI LETTER AY..DEVANAGARI VOWEL SIGN A10A34..10A35; PVALID     # KHAROSHTHI LETTER TTTA..KHAROSHTHI LETTER VHA10A48       ; DISALLOWED # KHAROSHTHI FRACTION ONE HALF10D00..10D27; PVALID     # HANIFI ROHINGYA LETTER A..HANIFI ROHINGYA SIG10D30..10D39; PVALID     # HANIFI ROHINGYA DIGIT ZERO..HANIFI ROHINGYA D10F00..10F1C; PVALID     # OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER10F1D..10F26; DISALLOWED # OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION10F27       ; PVALID     # OLD SOGDIAN LIGATURE AYIN-DALETH10F30..10F50; PVALID     # SOGDIAN LETTER ALEPH..SOGDIAN COMBINING STROK10F51..10F59; DISALLOWED # SOGDIAN NUMBER ONE..SOGDIAN PUNCTUATION HALF110CD       ; DISALLOWED # KAITHI NUMBER SIGN ABOVE11144..11146; PVALID     # CHAKMA LETTER LHAA..CHAKMA VOWEL SIGN EI1133B       ; PVALID     # COMBINING BINDU BELOW1145E       ; PVALID     # NEWA SANDHI MARK1171A       ; PVALID     # AHOM LETTER ALTERNATE BA11800..1183A; PVALID     # DOGRA LETTER A..DOGRA SIGN NUKTA1183B       ; DISALLOWED # DOGRA ABBREVIATION SIGN11A9D       ; PVALID     # SOYOMBO MARK PLUTA11D60..11D65; PVALID     # GUNJALA GONDI LETTER A..GUNJALA GONDI LETTER11D67..11D68; PVALID     # GUNJALA GONDI LETTER EE..GUNJALA GONDI LETTER11D6A..11D8E; PVALID     # GUNJALA GONDI LETTER OO..GUNJALA GONDI VOWEL11D90..11D91; PVALID     # GUNJALA GONDI VOWEL SIGN EE..GUNJALA GONDI VO11D93..11D98; PVALID     # GUNJALA GONDI VOWEL SIGN OO..GUNJALA GONDI OM11DA0..11DA9; PVALID     # GUNJALA GONDI DIGIT ZERO..GUNJALA GONDI DIGIT11EE0..11EF6; PVALID     # MAKASAR LETTER KA..MAKASAR VOWEL SIGN O11EF7..11EF8; DISALLOWED # MAKASAR PASSIMBANG..MAKASAR END OF SECTION16E40..16E5F; DISALLOWED # MEDEFAIDRIN CAPITAL LETTER M..MEDEFAIDRIN CAP16E60..16E7F; PVALID     # MEDEFAIDRIN SMALL LETTER M..MEDEFAIDRIN SMALL16E80..16E9A; DISALLOWED # MEDEFAIDRIN DIGIT ZERO..MEDEFAIDRIN EXCLAMATI187ED..187F1; PVALID     # <Tangut Ideograph>..<Tangut Ideograph>1D2E0..1D2F3; DISALLOWED # MAYAN NUMERAL ZERO..MAYAN NUMERAL NINETEEN1D372..1D378; DISALLOWED # IDEOGRAPHIC TALLY MARK ONE..TALLY MARK FIVE1EC71..1ECB4; DISALLOWED # INDIC SIYAQ NUMBER ONE..INDIC SIYAQ ALTERNATE1F12F       ; DISALLOWED # COPYLEFT SYMBOL1F6F9       ; DISALLOWED # SKATEBOARD1F7D5..1F7D8; DISALLOWED # CIRCLED TRIANGLE..NEGATIVE CIRCLED SQUARE1F94D..1F94F; DISALLOWED # LACROSSE STICK AND BALL..FLYING DISC1F96C..1F970; DISALLOWED # LEAFY GREEN..SMILING FACE WITH SMILING EYES A1F973..1F976; DISALLOWED # FACE WITH PARTY HORN AND PARTY HAT..FREEZING1F97A       ; DISALLOWED # FACE WITH PLEADING EYES1F97C..1F97F; DISALLOWED # LAB COAT..FLAT SHOE1F998..1F9A2; DISALLOWED # KANGAROO..SWAN1F9B0..1F9B9; DISALLOWED # EMOJI COMPONENT RED HAIR..SUPERVILLAIN1F9C1..1F9C2; DISALLOWED # CUPCAKE..SALT SHAKER1F9E7..1F9FF; DISALLOWED # RED GIFT ENVELOPE..NAZAR AMULET1FA60..1FA6D; DISALLOWED # XIANGQI RED GENERAL..XIANGQI BLACK SOLDIER

Appendix F.Changes from Unicode 11.0.0 to Unicode 12.0.0

Changes from derived property value UNASSIGNED to either PVALID or DISALLOWED.

0C77        ; DISALLOWED # TELUGU SIGN SIDDHAM0E86        ; PVALID     # LAO LETTER PALI GHA0E89        ; PVALID     # LAO LETTER PALI CHA0E8C        ; PVALID     # LAO LETTER PALI JHA0E8E..0E93  ; PVALID     # LAO LETTER PALI NYA..LAO LETTER PALI NNA0E98        ; PVALID     # LAO LETTER PALI DHA0EA0        ; PVALID     # LAO LETTER PALI BHA0EA8..0EA9  ; PVALID     # LAO LETTER SANSKRIT SHA..LAO LETTER SANSKRIT0EAC        ; PVALID     # LAO LETTER PALI LLA0EBA        ; PVALID     # LAO SIGN PALI VIRAMA1CFA        ; PVALID     # VEDIC SIGN DOUBLE ANUSVARA ANTARGOMUKHA2BC9        ; DISALLOWED # NEPTUNE FORM TWO2BFF        ; DISALLOWED # HELLSCHREIBER PAUSE SYMBOL2E4F        ; DISALLOWED # CORNISH VERSE DIVIDERA7BA        ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL AA7BB        ; PVALID     # LATIN SMALL LETTER GLOTTAL AA7BC        ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL IA7BD        ; PVALID     # LATIN SMALL LETTER GLOTTAL IA7BE        ; DISALLOWED # LATIN CAPITAL LETTER GLOTTAL UA7BF        ; PVALID     # LATIN SMALL LETTER GLOTTAL UA7C2        ; DISALLOWED # LATIN CAPITAL LETTER ANGLICANA WA7C3        ; PVALID     # LATIN SMALL LETTER ANGLICANA WA7C4..A7C6  ; DISALLOWED # LATIN CAPITAL LETTER C WITH PALATAL HOOK..LATAB66..AB67  ; PVALID     # LATIN SMALL LETTER DZ DIGRAPH WITH RETROFLEX10FE0..10FF6; PVALID     # ELYMAIC LETTER ALEPH..ELYMAIC LIGATURE ZAYIN-1145F       ; PVALID     # NEWA LETTER VEDIC ANUSVARA116B8       ; PVALID     # TAKRI LETTER ARCHAIC KHA119A0..119A7; PVALID     # NANDINAGARI LETTER A..NANDINAGARI LETTER VOCA119AA..119D7; PVALID     # NANDINAGARI LETTER E..NANDINAGARI VOWEL SIGN119DA..119E1; PVALID     # NANDINAGARI VOWEL SIGN E..NANDINAGARI SIGN AV119E2       ; DISALLOWED # NANDINAGARI SIGN SIDDHAM119E3..119E4; PVALID     # NANDINAGARI HEADSTROKE..NANDINAGARI VOWEL SIG11A84..11A85; PVALID     # SOYOMBO SIGN JIHVAMULIYA..SOYOMBO SIGN UPADHM11FC0..11FF1; DISALLOWED # TAMIL FRACTION ONE THREE-HUNDRED-AND-TWENTIET11FFF       ; DISALLOWED # TAMIL PUNCTUATION END OF TEXT13430..13438; DISALLOWED # EGYPTIAN HIEROGLYPH VERTICAL JOINER..EGYPTIAN16F45..16F4A; PVALID     # MIAO LETTER BRI..MIAO LETTER RTE16F4F       ; PVALID     # MIAO SIGN CONSONANT MODIFIER BAR16F7F..16F87; PVALID     # MIAO VOWEL SIGN UOG..MIAO VOWEL SIGN UI16FE2       ; DISALLOWED # OLD CHINESE HOOK MARK16FE3       ; PVALID     # OLD CHINESE ITERATION MARK187F2..187F7; PVALID     # <Tangut Ideograph>..<Tangut Ideograph>1B150..1B152; PVALID     # HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMA1B164..1B167; PVALID     # KATAKANA LETTER SMALL WI..KATAKANA LETTER SMA1E100..1E12C; PVALID     # NYIAKENG PUACHUE HMONG LETTER MA..NYIAKENG PU1E130..1E13D; PVALID     # NYIAKENG PUACHUE HMONG TONE-B..NYIAKENG PUACH1E140..1E149; PVALID     # NYIAKENG PUACHUE HMONG DIGIT ZERO..NYIAKENG P1E14E       ; PVALID     # NYIAKENG PUACHUE HMONG LOGOGRAM NYAJ1E14F       ; DISALLOWED # NYIAKENG PUACHUE HMONG CIRCLED CA1E2C0..1E2F9; PVALID     # WANCHO LETTER AA..WANCHO DIGIT NINE1E2FF       ; DISALLOWED # WANCHO NGUN SIGN1E94B       ; PVALID     # ADLAM NASALIZATION MARK1ED01..1ED3D; DISALLOWED # OTTOMAN SIYAQ NUMBER ONE..OTTOMAN SIYAQ FRACT1F16C       ; DISALLOWED # RAISED MR SIGN1F6D5       ; DISALLOWED # HINDU TEMPLE1F6FA       ; DISALLOWED # AUTO RICKSHAW1F7E0..1F7EB; DISALLOWED # LARGE ORANGE CIRCLE..LARGE BROWN SQUARE1F90D..1F90F; DISALLOWED # WHITE HEART..PINCHING HAND1F93F       ; DISALLOWED # DIVING MASK1F971       ; DISALLOWED # YAWNING FACE1F97B       ; DISALLOWED # SARI1F9A5..1F9AA; DISALLOWED # SLOTH..OYSTER1F9AE..1F9AF; DISALLOWED # GUIDE DOG..PROBING CANE1F9BA..1F9BF; DISALLOWED # SAFETY VEST..MECHANICAL LEG1F9C3..1F9CA; DISALLOWED # BEVERAGE BOX..ICE CUBE1F9CD..1F9CF; DISALLOWED # STANDING PERSON..DEAF PERSON1FA00..1FA53; DISALLOWED # NEUTRAL CHESS KING..BLACK CHESS KNIGHT-BISHOP1FA70..1FA73; DISALLOWED # BALLET SHOES..SHORTS1FA78..1FA7A; DISALLOWED # DROP OF BLOOD..STETHOSCOPE1FA80..1FA82; DISALLOWED # YO-YO..PARACHUTE1FA90..1FA95; DISALLOWED # RINGED PLANET..BANJO

Acknowledgments

Thanks toHarald Alvestrand,Marc Blanchet,Martin Dürst,Asmus Freytag,Ted Hardie,John Klensin,Erik Nordmark,Pete Resnick,Peter Saint-Andre,Michel Suignard,Andrew Sullivan, andSuzanne Woolf for input to this document.

Author's Address

Patrik Fältström
Netnod
Email:paf@netnod.se

[8]ページ先頭

©2009-2025 Movatter.jp