Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

KOI8-U

From Wikipedia, the free encyclopedia
Character encoding for Ukrainian Cyrillic
KOI8-U
LanguagesUkrainian,Russian,Bulgarian
Classification8-bitKOI,extended ASCII
ExtendsKOI8-B
Based onKOI8-R
Other related encodingsKOI8-RU,KOI8-F

KOI8-U (RFC 2319) is an 8-bitcharacter encoding, designed to coverUkrainian, which uses aCyrillic alphabet. It is based onKOI8-R, which coversRussian andBulgarian, but replaces eight box drawing characters with four Ukrainian lettersҐ,Є,І, andЇ in both upper case and lower case.

KOI8-RU is closely related, but adds Ў forBelarusian. In both, the letter allocations match those inKOI8-E, except for Ґ which is added toKOI8-F.

InMicrosoft Windows, KOI8-U is assigned the code page number 21866. InIBM, KOI8-U is assigned code page/CCSID 1168.[1][2][3]

KOI8 remains much more commonly used thanISO 8859-5, which never really caught on.[citation needed] Another common Cyrillic character encoding isWindows-1251. In the future, both may eventually give way toUnicode.

KOI8 stands forKod Obmena Informatsiey, 8 bit (Russian:Код Обмена Информацией, 8 бит) which means "Code for Information Exchange, 8 bit".

The KOI8 character sets have the property that the Cyrillic letters are in pseudo-Latin alphabetic order rather than Cyrillic alphabetical order as in ISO 8859-5. This has the useful effect that if the eighth bit is stripped and the text is presented in any character set based on ASCII including the KOI8 sets themselves, the text is still reasonably human readable as a case-reversed transliteration. For instance, the "KOI" acronym "Код Обмена Информацией" becomeskOD oBMENA iNFORMACIEJ.

Character set

[edit]

The following table shows the KOI8-U encoding.[1][4] Each character is shown with its equivalentUnicode code point.

KOI8-U
0123456789ABCDEF
0x
1x
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~
8x
2500

2502

250C

2510

2514

2518

251C

2524

252C

2534

253C

2580

2584

2588

258C

2590
9x
2591

2592

2593

2320

25A0

2219

221A

2248

2264

2265
NBSP
2321
°
00B0
²
00B2
·
00B7
÷
00F7
Ax
2550

2551

2552
ё
0451
є
0454

2554
і
0456
ї
0457

2557

2558

2559

255A

255B
ґ
0491

255D

255E
Bx
255F

2560

2561
Ё
0401
Є
0404

2563
І
0406
Ї
0407

2566

2567

2568

2569

256A
Ґ
0490

256C
©
00A9
Cxю
044E
а
0430
б
0431
ц
0446
д
0434
е
0435
ф
0444
г
0433
х
0445
и
0438
й
0439
к
043A
л
043B
м
043C
н
043D
о
043E
Dxп
043F
я
044F
р
0440
с
0441
т
0442
у
0443
ж
0436
в
0432
ь
044C
ы
044B
з
0437
ш
0448
э
044D
щ
0449
ч
0447
ъ
044A
ExЮ
042E
А
0410
Б
0411
Ц
0426
Д
0414
Е
0415
Ф
0424
Г
0413
Х
0425
И
0418
Й
0419
К
041A
Л
041B
М
041C
Н
041D
О
041E
FxП
041F
Я
042F
Р
0420
С
0421
Т
0422
У
0423
Ж
0416
В
0412
Ь
042C
Ы
042B
З
0417
Ш
0428
Э
042D
Щ
0429
Ч
0427
Ъ
042A
  Differences withKOI8-R (non-Russian letters)

Although RFC 2319 says that character 0x95 should be U+2219 (∙), it may also be U+2022 (•) to match the bullet character inWindows-1251.

Some references have a typo and incorrectly state that character 0xB4 is U+0403, rather than the correct U+0404. This typo is present in Appendix A of RFC 2319 (but the table in the main text of the RFC gives the correct mapping).

See also

[edit]

References

[edit]
  1. ^ab"SBCS code page information - CPGID: 01168 / Name: Ukrainian KOI8-U".IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers.IBM. C-H 3-3220-050.Archived from the original on 2017-02-18. Retrieved2017-02-18.[1][2]
  2. ^"CCSID information document; CCSID 1168; KOI8-U".IBM.Archived from the original on 2017-02-18. Retrieved2017-02-18.
  3. ^International Components for Unicode (ICU), ibm-1168_P100-2002.ucm, 2002-12-03
  4. ^Verdy, Philippe; Richter, Helmut (2016-01-04) [2008-10-13]."KOI8-U.TXT". 2.0. Retrieved2016-12-09.

Further reading

[edit]

External links

[edit]
Multilingual
National
Russian
East Slavic
South Slavic
Other
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=KOI8-U&oldid=1322910955"
Category:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp