Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

ISO/IEC 8859-2

From Wikipedia, the free encyclopedia
8-bit character set for Central and Eastern European languages in Latin script
ISO/IEC 8859-2
MIME / IANAISO-8859-2
Alias(es)iso-ir-101, csISOLatin2, latin2, l2, IBM1111
Language(see below)
StandardECMA-94:1986,ISO/IEC 8859
ClassificationExtended ASCII,ISO/IEC 8859
ExtendsUS-ASCII
Based onISO-8859-1
Other related encodingsWindows-1250,MacCroatian

ISO/IEC 8859-2:1999,Information technology — 8-bit single-byte coded graphic character sets — Part 2: Latin alphabet No. 2, is part of theISO/IEC 8859 series of ASCII-based standardcharacter encodings, first edition published in 1987. It is informally referred to as "Latin-2". It is generally intended for Central[1] or "Eastern European" languages that are written in the Latin script. Note that ISO/IEC 8859-2 is very different from code page 852 (MS-DOS Latin 2, PC Latin 2) which is also referred to as "Latin-2" in Czech and Slovak regions.[2] Almost half the use of the encoding is for Polish, and it's the main legacy encoding for Polish, while virtually all use of it has been replaced by UTF-8 (on the web).

ISO-8859-2 is theIANA preferred charset name for this standard when supplemented with theC0 and C1 control codes fromISO/IEC 6429. Less than 0.04% of all web pages use ISO-8859-2 as of October 2022.[3][4] Microsoft has assignedcode page 28592 a.k.a.Windows-28592 to ISO-8859-2 in Windows. IBM assignedcode page 912 to ISO 8859-2,[5] until that code page was extended in 1999.[6]Code page 1111 is similar, but replaces byte B0 ° (degree sign) with U+02DA ˚ (ring above).

Windows-1250 is similar to ISO-8859-2 and has all the printable characters it has and more. However a few of them are rearranged (unlikeWindows-1252, which keeps all printable characters fromISO-8859-1 in the same place).

Language coverage

[edit]

These code values can be used for the following languages:

  1. ^The missing letterÅ is officially a part of theFinnish alphabet, however it has no native use and its usage is limited to foreign names only.
  2. ^In 2017, theCouncil for German Orthography officially added a capital, but is not actually required as SS can be used instead.
  3. ^This character set unifiesȘ andȚ (S,T with commas below) withŞ andŢ (S, T withcedillas), as did virtually all other character sets including Microsoft'sWindows-1250 and the first version ofUnicode. Unicode subsequently disunified them however, this complicated processing of Romanian data; pre-existing data and input methods would still contain the older cedilla codepoints, complicating text searching.[citation needed]

Code page layout

[edit]

Differences fromISO-8859-1 have the Unicode code point number underneath.

ISO/IEC 8859-2 (Latin-2)
0123456789ABCDEF
0x
1x
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~
8x
9x
AxNBSPĄ
0104
˘
02D8
Ł
0141
¤Ľ
013D
Ś
015A
§¨Š
0160
Ş
015E
Ť
0164
Ź
0179
SHYŽ
017D
Ż
017B
Bx°ą
0105
˛
02DB
ł
0142
´ľ
013E
ś
015B
ˇ
02C7
¸š
0161
ş
015F
ť
0165
ź
017A
˝
02DD
ž
017E
ż
017C
CxŔ
0154
ÁÂĂ
0102
ÄĹ
0139
Ć
0106
ÇČ
010C
ÉĘ
0118
ËĚ
011A
ÍÎĎ
010E
DxĐ
0110
Ń
0143
Ň
0147
ÓÔŐ
0150
Ö×Ř
0158
Ů
016E
ÚŰ
0170
ÜÝŢ
0162
ß
Exŕ
0155
áâă
0103
äĺ
013A
ć
0107
çč
010D
éę
0119
ëě
011B
íîď
010F
Fxđ
0111
ń
0144
ň
0148
óôő
0151
ö÷ř
0159
ů
016F
úű
0171
üýţ
0163
˙
02D9

See also

[edit]

References

[edit]
  1. ^"Microsoft Outlook Message Encodings". 10 January 2017.
  2. ^"The Czech and Slovak Character Encoding Mess Explained".luki.sdf-eu.org. Retrieved2022-02-27.
  3. ^"Usage Statistics and Market Share of ISO-8859-2 for Websites, October 2022".w3techs.com. Retrieved2022-10-23.
  4. ^"Historical trends in the usage statistics of character encodings for websites, February 2022".
  5. ^"Icu-data/Charset/Data/XML/Ibm-912_P100-1995.XML at main · unicode-org/Icu-data".GitHub.
  6. ^"Icu-data/Charset/Data/Ucm/Ibm-912_P100-1999.ucm at main · unicode-org/Icu-data".GitHub.

External links

[edit]
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=ISO/IEC_8859-2&oldid=1315989658"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp