Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Unicode collation algorithm

From Wikipedia, the free encyclopedia
String collation algorithm

TheUnicode collation algorithm (UCA) is an algorithm defined in Unicode Technical Report #10, which is a customizable method to produce binary keys fromstrings representing text in anywriting system andlanguage that can be represented withUnicode. These keys can then be efficiently compared byte by byte in order tocollate or sort them according to the rules of the language, with options for ignoring case, accents, etc.[1]

Unicode Technical Report #10 also specifies theDefault Unicode Collation Element Table (DUCET). This data file specifies a default collation ordering. The DUCET is customizable for different languages,[1][2] and some such customizations can be found in the UnicodeCommon Locale Data Repository (CLDR).[3]

An open source implementation of UCA is included with theInternational Components for Unicode, ICU.[4][5] ICU supports tailoring, and the collation tailorings from CLDR are included in ICU.[6][2]

See also

[edit]

References

[edit]
  1. ^abWhistler, Ken; Scherer, Markus;Davis, Mark (2022-08-26)."UTS #10: Unicode Collation Algorithm".Unicode. Retrieved2023-08-16.
  2. ^abHosken, Martin (2021-09-23).Unicode Sort Tailoring: Tutorial(PDF) (1.3 ed.).SIL Writing Systems Technology. pp. 2–3. Retrieved2023-08-16.
  3. ^"CLDR Releases/Downloads".Unicode CLDR. Retrieved2023-08-16.
  4. ^"ICU - International Components for Unicode".Unicode. Retrieved2023-08-16.
  5. ^"Collations".SyBooks Online. Retrieved2023-08-16.
  6. ^"Customization".ICU Documentation. Retrieved2023-08-16.

External links

[edit]

Tools

[edit]
Unicode
Code points
Characters
Special purpose
Lists
Processing
Algorithms
Comparison of encodings
On pairs of
code points
Usage
Related standards
Related topics
Scripts and symbols in Unicode
Common and
inherited scripts
Modern scripts
Ancient and
historic scripts
Notational scripts
Symbols, emojis


Stub icon

Thisalgorithms ordata structures-related article is astub. You can help Wikipedia byexpanding it.

Stub icon

Thisstandards- ormeasurement-related article is astub. You can help Wikipedia byexpanding it.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Unicode_collation_algorithm&oldid=1288180880"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp