MIME / IANA | windows-1251 |
---|---|
Alias(es) | cp1251 (Code page 1251) |
Language(s) | Russian,Ukrainian,Belarusian,Bulgarian,Serbian Cyrillic,Bosnian Cyrillic,Macedonian,Rotokas,Rusyn,English |
Created by | Microsoft |
Standard | WHATWG Encoding Standard |
Classification | extended ASCII,Windows-125x |
Other related encoding(s) | Amiga-1251,KZ-1048, RFC 1345's "ECMA-Cyrillic" |
Windows-1251 is an 8-bitcharacter encoding, designed to cover languages that use theCyrillic script such asRussian,Ukrainian,Belarusian,Bulgarian,Serbian Cyrillic,Macedonian and other languages.
On the web, it is the second most-used single-byte character encoding (or third most-used character encoding overall), and most used of the single-byte encodings supporting Cyrillic. As of January 2024[update], 0.3% of allwebsites use Windows-1251.[1][2] It's by far mostly used for Russian, while a small minority of Russian websites use it, with 94.6% of Russian (.ru) websites usingUTF-8,[3][4][5] and the legacy 8-bit encoding is distant second. In Linux, the encoding is known as cp1251.[6]IBM uses code page 1251 (CCSID 1251 andeuro sign extended CCSID 5347) for Windows-1251.[7][8][9][10][11][12][13]
Windows-1251 andKOI8-R (or itsUkrainian variantKOI8-U) are much more commonly used thanISO 8859-5 (which is used by less than 0.0004% of websites).[14] In contrast toWindows-1252 andISO 8859-1, Windows-1251 is not closely related to ISO 8859-5.
Unicode (e.g. UTF-8) is preferred to Windows-1251 or other Cyrillic encodings in modern applications, especially on the Internet, making UTF-8 the dominant encoding for web pages. (For further discussion of Unicode's complete coverage, of 436 Cyrillic letters/code points, including forOld Cyrillic, and how single-byte character encodings, such as Windows-1251 andKOI8-R, cannot provide this, seeCyrillic script in Unicode.)
The following table shows Windows-1251. Each character is shown with itsUnicode equivalent and itsAlt code.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
8x | Ђ | Ѓ | ‚ | ѓ | „ | … | † | ‡ | € | ‰ | Љ | ‹ | Њ | Ќ | Ћ | Џ |
9x | ђ | ‘ | ’ | “ | ” | • | – | — | ™ | љ | › | њ | ќ | ћ | џ | |
Ax | NBSP | Ў | ў | Ј | ¤ | Ґ | ¦ | § | Ё | © | Є | « | ¬ | SHY | ® | Ї |
Bx | ° | ± | І | і | ґ | µ | ¶ | · | ё | № | є | » | ј | Ѕ | ѕ | ї |
Cx | А | Б | В | Г | Д | Е | Ж | З | И | Й | К | Л | М | Н | О | П |
Dx | Р | С | Т | У | Ф | Х | Ц | Ч | Ш | Щ | Ъ | Ы | Ь | Э | Ю | Я |
Ex | а | б | в | г | д | е | ж | з | и | й | к | л | м | н | о | п |
Fx | р | с | т | у | ф | х | ц | ч | ш | щ | ъ | ы | ь | э | ю | я |
An altered version of Windows-1251 was standardised inKazakhstan as Kazakh standard STRK1048, and is known by the labelKZ-1048
. It differs in the rows shown below:
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
8x | Ђ | Ѓ | ‚ | ѓ | „ | … | † | ‡ | € | ‰ | Љ | ‹ | Њ | Қ | Һ | Џ |
9x | ђ | ‘ | ’ | “ | ” | • | – | — | ™ | љ | › | њ | қ | һ | џ | |
Ax | NBSP | Ұ | ұ | Ә | ¤ | Ө | ¦ | § | Ё | © | Ғ | « | ¬ | SHY | ® | Ү |
Bx | ° | ± | І | і | ө | µ | ¶ | · | ё | № | ғ | » | ә | Ң | ң | ү |
Code Page 1174 is another variant created for theKazakh language, which matches Windows-1251 for theRussian subset of the Cyrillic letters. It differs from KZ-1048 by moving the Cyrillic letterShha from 8E/9E to 8A/9A.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
8x | Ђ | Ѓ | ‚ | ѓ | „ | … | † | ‡ | € | ‰ | Һ | ‹ | Њ | Қ | Ћ | Џ |
9x | ђ | ‘ | ’ | “ | ” | • | – | — | ™ | һ | › | њ | қ | ћ | џ | |
Ax | NBSP | Ұ | ұ | Ә | ¤ | Ө | ¦ | § | Ё | © | Ғ | « | ¬ | SHY | ® | Ү |
Bx | ° | ± | І | і | ө | µ | ¶ | · | ё | № | ғ | » | ә | Ң | ң | ү |
Windows Latvian + Russian is a modification of Windows-1251 to support theLatvian language.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
8x | Ū | Ģ | ‚ | ō | „ | … | † | ‡ | Ž | ‰ | Š | ‹ | Ē | Ķ | Č | ģ |
9x | ū | ‘ | ’ | “ | ” | • | – | — | ž | ™ | š | › | ē | ķ | č | Ō |
Ax | NBSP | Ā | ā | Ļ | ¤ | ļ | ¦ | § | Ё | © | Ņ | « | ¬ | SHY | ® | ¯ |
Bx | ° | ± | Ī | ī | ´ | µ | ¶ | · | ё | № | ņ | » | ¼ | ½ | ¾ | × |
Windows Cyrillic + Finnish is a modification of Windows-1251 that was used byParatype to cover theFinnish language. This encoding is supported byFontLab Studio 5.[18] This variant is missing the lettersŠ andŽ which are used in loanwords in Finnish and can be replaced by the digraphs SH and ZH.
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
8x | Ђ | Ѓ | ‚ | ѓ | „ | … | † | ‡ | ˆ | ‰ | Љ | ‹ | Њ | Ќ | Ћ | Џ |
9x | ђ | ‘ | ’ | “ | ” | • | – | — | ˜ | ™ | љ | › | њ | ќ | ћ | џ |
Ax | NBSP | Ў | ў | Ó | ¤ | Ґ | ¦ | § | Ё | © | Ä | « | ¬ | SHY | ® | Ö |
Bx | ° | ± | Å | å | ґ | µ | ¶ | · | ё | № | ä | » | ó | É | é | ö |
MIME / IANA | Amiga-1251 |
---|---|
Alias(es) | Ami1251 |
Language(s) | English,Russian |
Classification | extended ASCII |
Based on | Windows-1251,ISO-8859-1,ISO-8859-15 |
RussianAmiga OS systems used a version of code page 1251 which matches Windows-1251 for theRussian subset of the Cyrillic letters, but otherwise mostly followsISO-8859-1. This version is known asAmiga-1251,[19] under which name it is registered with theIANA.[20]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
8x | XXX | XXX | BPH | NBH | IND | NEL | SSA | ESA | HTS | HTJ | VTS | PLD | PLU | RI | SS2 | SS3 |
9x | DCS | PU1 | PU2 | STS | CCH | MW | SPA | EPA | SOS | XXX | SCI | CSI | ST | OSC | PM | APC |
Ax | NBSP | ¡ | ¢ | £ | €[a] | ¥ | ¦ | § | Ё | © | №[b] | « | ¬ | SHY | ® | ¯ |
Bx | ° | ± | ² | ³ | ´ | µ | ¶ | · | ё | ¹ | º | » | ¼ | ½ | ¾ | ¿ |