Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

KOI8-R

From Wikipedia, the free encyclopedia
Character encoding
KOI8-R
Alias(es)cp878 (code page 878)
LanguagesRussian,Bulgarian
Classification8-bitKOI,extended ASCII
ExtendsKOI8-B
Based onKOI-8
Other related encodingsKOI8-U,KOI8-RU

KOI8-R (RFC 1489) is an 8-bitcharacter encoding derived from theKOI-8 encoding by the programmerAndrei Chernov in 1993 and designed to coverRussian, which uses theRussian subset of aCyrillic script. KOI-8, on its turn, is an 8-bit extension of theKOI-7 encoding, which inherited aphonetic correspondence of Russian and Latin letters from theMTK-2teletype code. As a result, Russian Cyrillic letters in KOI8-R are in pseudo-Latin alphabetical order rather than the normal Cyrillic one like inISO 8859-5. Although this may seem unnatural, this has the useful effect that if the 8th bit is stripped, the text remains partially readable in anyASCII-based encoding (including KOI8-R itself) as a case-reversedtransliteration. For example, "Код для обмена и обработки информации" (the Russian meaning of the "KOI" acronym) becomeskOD DLQ OBMENA I OBRABOTKI INFORMACII.

KOI-8 stands for8-bitnyy kod dlya obmena i obrabotki informatsii (Russian:8-битный код для обмена и обработки информации) which means "8-Bit Code for Information Interchange".[1] InMicrosoft Windows, KOI8-R is assigned the code page number 20866. InIBM, KOI8-R is assigned code page 878.[2][3] KOI8-R also happens to coverBulgarian.

It lacks proper quotation marks for these languages: both «...» and the Bulgarian „...“.Windows-1251 does support these, as well as more letters, and has thus become more popular. KOI8-R is used by less than 0.004% of websites, mostly Russian and Bulgarian.[citation needed]Unicode andUTF-8 is preferred to single-byte Cyrillic encodings in modern applications, Unicode contains 436Cyrillic letters including forOld Cyrillic.

Character set

[edit]

The following table shows the KOI8-R encoding. Each character is shown with its equivalentUnicode code point.

KOI8-R[4][5][6][7]
0123456789ABCDEF
0x
1x
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~
8x
2500

2502

250C

2510

2514

2518

251C

2524

252C

2534

253C

2580

2584

2588

258C

2590
9x
2591

2592

2593

2320

25A0

2219

221A

2248

2264

2265
NBSP
2321
°
00B0
²
00B2
·
00B7
÷
00F7
Ax
2550

2551

2552
ё
0451

2553

2554

2555

2556

2557

2558

2559

255A

255B

255C

255D

255E
Bx
255F

2560

2561
Ё
0401

2562

2563

2564

2565

2566

2567

2568

2569

256A

256B

256C
©
00A9
Cxю
044E
а
0430
б
0431
ц
0446
д
0434
е
0435
ф
0444
г
0433
х
0445
и
0438
й
0439
к
043A
л
043B
м
043C
н
043D
о
043E
Dxп
043F
я
044F
р
0440
с
0441
т
0442
у
0443
ж
0436
в
0432
ь
044C
ы
044B
з
0437
ш
0448
э
044D
щ
0449
ч
0447
ъ
044A
ExЮ
042E
А
0410
Б
0411
Ц
0426
Д
0414
Е
0415
Ф
0424
Г
0413
Х
0425
И
0418
Й
0419
К
041A
Л
041B
М
041C
Н
041D
О
041E
FxП
041F
Я
042F
Р
0420
С
0421
Т
0422
У
0423
Ж
0416
В
0412
Ь
042C
Ы
042B
З
0417
Ш
0428
Э
042D
Щ
0429
Ч
0427
Ъ
042A

See also

[edit]

References

[edit]
  1. ^(in Russian) ГОСТ 19768-74 (СТ СЭВ 358-76). Машины вычислительные и система обработки данных. Коды 8-битные для обмена и обработки информации.
  2. ^"SBCS code page information - CPGID: 00878 / Name: Russian internet koi8-r".IBM Software: Globalization: Coded character sets and related resources: Code pages by CPGID: Code page identifiers.IBM. C-H 3-3220-050.Archived from the original on 2017-02-18. Retrieved2017-02-18.
  3. ^"CCSID information document; CCSID 878; KOI8-R CYRILLIC".IBM. Retrieved2017-02-18.
  4. ^Richter, Helmut (2016-01-04) [1999-08-18]."KOI8-R.TXT". 2.0. Retrieved2016-12-09.
  5. ^Code Page CPGID 00878 (pdf)(PDF), IBM
  6. ^Code Page CPGID 00878 (txt), IBM
  7. ^International Components for Unicode (ICU), ibm-878_P100-1996.ucm, 2002-12-03

Further reading

[edit]

External links

[edit]
Multilingual
National
Russian
East Slavic
South Slavic
Other
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=KOI8-R&oldid=1322912093"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp