Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Windows-1251

From Wikipedia, the free encyclopedia
(Redirected fromCode page 1174)
Windows character set for Cyrillic alphabet
Windows-1251
MIME / IANAwindows-1251
Alias(es)cp1251 (Code page 1251)
Language(s)Russian,Ukrainian,Belarusian,Bulgarian,Serbian Cyrillic,Bosnian Cyrillic,Macedonian,Rotokas,Rusyn,English
Created byMicrosoft
StandardWHATWG Encoding Standard
Classificationextended ASCII,Windows-125x
Other related encoding(s)Amiga-1251,KZ-1048,
RFC 1345's "ECMA-Cyrillic"

Windows-1251 is an 8-bitcharacter encoding, designed to cover languages that use theCyrillic script such asRussian,Ukrainian,Belarusian,Bulgarian,Serbian Cyrillic,Macedonian and other languages.

On the web, it is the second most-used single-byte character encoding (or third most-used character encoding overall), and most used of the single-byte encodings supporting Cyrillic. As of January 2024[update], 0.3% of allwebsites use Windows-1251.[1][2] It's by far mostly used for Russian, while a small minority of Russian websites use it, with 94.6% of Russian (.ru) websites usingUTF-8,[3][4][5] and the legacy 8-bit encoding is distant second. In Linux, the encoding is known as cp1251.[6]IBM uses code page 1251 (CCSID 1251 andeuro sign extended CCSID 5347) for Windows-1251.[7][8][9][10][11][12][13]

Windows-1251 andKOI8-R (or itsUkrainian variantKOI8-U) are much more commonly used thanISO 8859-5 (which is used by less than 0.0004% of websites).[14] In contrast toWindows-1252 andISO 8859-1, Windows-1251 is not closely related to ISO 8859-5.

Unicode (e.g. UTF-8) is preferred to Windows-1251 or other Cyrillic encodings in modern applications, especially on the Internet, making UTF-8 the dominant encoding for web pages. (For further discussion of Unicode's complete coverage, of 436 Cyrillic letters/code points, including forOld Cyrillic, and how single-byte character encodings, such as Windows-1251 andKOI8-R, cannot provide this, seeCyrillic script in Unicode.)

Character set

[edit]

The following table shows Windows-1251. Each character is shown with itsUnicode equivalent and itsAlt code.

Windows-1251[15]
0123456789ABCDEF
0xNULSOHSTXETXEOTENQACKBELBSHTLFVTFFCRSOSI
1xDLEDC1DC2DC3DC4NAKSYNETBCANEMSUBESCFSGSRSUS
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~DEL
8xЂЃѓЉЊЌЋЏ
9xђљњќћџ
AxNBSPЎўЈ¤Ґ¦§Ё©Є«¬SHY®Ї
Bx°±Ііґµ·ёє»јЅѕї
CxАБВГДЕЖЗИЙКЛМНОП
DxРСТУФХЦЧШЩЪЫЬЭЮЯ
Exабвгдежзийклмноп
Fxрстуфхцчшщъыьэюя
  Differences fromWindows-1252

Kazakh variants

[edit]

An altered version of Windows-1251 was standardised inKazakhstan as Kazakh standard STRK1048, and is known by the labelKZ-1048. It differs in the rows shown below:

KZ-1048 (STRK1048-2002)[16]
0123456789ABCDEF
8xЂЃѓЉЊҚҺЏ
9xђљњқһџ
AxNBSPҰұӘ¤Ө¦§Ё©Ғ«¬SHY®Ү
Bx°±Ііөµ·ёғ»әҢңү
  Differences from Windows-1251

Code Page 1174 is another variant created for theKazakh language, which matches Windows-1251 for theRussian subset of the Cyrillic letters. It differs from KZ-1048 by moving the Cyrillic letterShha from 8E/9E to 8A/9A.

Code page 1174[17]
0123456789ABCDEF
8xЂЃѓҺЊҚЋЏ
9xђһњқћџ
AxNBSPҰұӘ¤Ө¦§Ё©Ғ«¬SHY®Ү
Bx°±Ііөµ·ёғ»әҢңү
  Different from Windows-1251

Latvian variant

[edit]

Windows Latvian + Russian is a modification of Windows-1251 to support theLatvian language.

Windows Latvian + Russian
0123456789ABCDEF
8xŪĢōŽŠĒĶČģ
9xūžšēķčŌ
AxNBSPĀāĻ¤ļ¦§Ё©Ņ«¬SHY®¯
Bx°±Īī´µ·ёņ»¼½¾×
  Differences from Windows-1251

Finnish variant

[edit]

Windows Cyrillic + Finnish is a modification of Windows-1251 that was used byParatype to cover theFinnish language. This encoding is supported byFontLab Studio 5.[18] This variant is missing the lettersŠ andŽ which are used in loanwords in Finnish and can be replaced by the digraphs SH and ZH.

Windows Cyrillic + Finnish
0123456789ABCDEF
8xЂЃѓˆЉЊЌЋЏ
9xђ˜ љњќћџ
AxNBSPЎўÓ¤Ґ¦§Ё©Ä«¬SHY®Ö
Bx°±Ååґµ·ёä»óÉéö
  Differences from Windows-1251

Amiga variant

[edit]
Amiga-1251
MIME / IANAAmiga-1251
Alias(es)Ami1251
Language(s)English,Russian
Classificationextended ASCII
Based onWindows-1251,ISO-8859-1,ISO-8859-15

RussianAmiga OS systems used a version of code page 1251 which matches Windows-1251 for theRussian subset of the Cyrillic letters, but otherwise mostly followsISO-8859-1. This version is known asAmiga-1251,[19] under which name it is registered with theIANA.[20]

Amiga-1251[19]
0123456789ABCDEF
8xXXXXXXBPHNBHINDNELSSAESAHTSHTJVTSPLDPLURISS2SS3
9xDCSPU1PU2STSCCHMWSPAEPASOSXXXSCICSISTOSCPMAPC
AxNBSP¡¢£[a]¥¦§Ё©[b]«¬SHY®¯
Bx°±²³´µ·ё¹º»¼½¾¿
  Different from Windows-1251 to match ISO-8859-1
  Different from both Windows-1251 and ISO-8859-1
  1. ^MatchingISO-8859-15; at a different location than in Windows-1251
  2. ^Present in Windows-1251, but in a different location (absent from ISO-8859-1/15)

References

[edit]

See also

[edit]

References

[edit]
  1. ^"Historical trends in the usage of character encodings, January 2024". Retrieved2024-01-01.
  2. ^"Frequently Asked Questions".
  3. ^"Distribution of Character Encodings among websites that use .ru".w3techs.com. Retrieved2024-01-01.
  4. ^"Distribution of Character Encodings among websites that use Russian".w3techs.com. Retrieved2023-01-16.
  5. ^"Distribution of Character Encodings among websites that use Russian Federation".w3techs.com. Retrieved2021-11-05.
  6. ^"cp1251(7) - Linux manual page".man7.org. Retrieved2018-07-01.
  7. ^"Code page 1251 information document". Archived fromthe original on 2016-03-03.
  8. ^"CCSID 1251 information document". Archived fromthe original on 2014-11-29.
  9. ^"CCSID 5347 information document". Archived fromthe original on 2014-11-29.
  10. ^Code Page CPGID 01251 (pdf)(PDF), IBM
  11. ^Code Page CPGID 01251 (txt), IBM
  12. ^International Components for Unicode (ICU), ibm-1251_P100-1995.ucm, 2002-12-03
  13. ^International Components for Unicode (ICU), ibm-5347_P100-1998.ucm, 2002-12-03
  14. ^"Usage Statistics of Character Encodings for Websites".w3techs.com.Archived from the original on 2012-05-30.
  15. ^Steele, Shawn (1998).CP1251 to Unicode table.Unicode Consortium. CP1251.TXT.
  16. ^Whistler, Ken (2007).KZ-1048 to Unicode.Unicode Consortium. KZ1048.TXT.
  17. ^ibm-1174_X100-2007.ucm, IBM
  18. ^"FontLab Studio 5. Classic pro font editor for Mac & Windows".
  19. ^abMalyshev, Michael (2003). "Amiga-1251 to Unicode table".Registration of new charset [Amiga-1251]. IANA.
  20. ^"Character Sets". IANA.

Further reading

[edit]
  • Kornai, Andras; Birnbaum, David J.; da Cruz, Frank; Davis, Bur; Fowler, George; Paine, Richard B.; Paperno, Slava; Simonsen, Keld J.; Thobe, Glenn E.; Vulis, Dimitri; van Wingen, Johan W. (1993-03-13)."CYRILLIC ENCODING FAQ Version 1.3". Retrieved2020-06-24.

External links

[edit]
Multilingual
National
Russian
East Slavic
South Slavic
Other
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Windows-1251&oldid=1280437867#Code_page_1174"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp