Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Windows-1256

From Wikipedia, the free encyclopedia
Windows character set for Arabic
Windows-1256
MIME / IANAwindows-1256
Alias(es)cp1256 (Code page 1256)
LanguagesArabic,Persian,Urdu,English,French (except capital letters with diacritics)
Created byMicrosoft
StandardWHATWG Encoding Standard
Classificationextended ASCII,Windows-125x

Windows-1256 is acode page used underMicrosoft Windows to writeArabic and other languages that useArabic script, such asPersian andUrdu.

This code page isneither compatible withISO/IEC 8859-6 nor theMacArabic encoding.

Windows-1256 encodes everyabstract single letter of the basic Arabic alphabet, not every concrete visual form of isolated, initial, medial, final or ligatured letter shape variants (i.e. it encodes characters, not glyphs). The Arabic letters in the C0-FF range are in Arabic alphabetic order, but some Latin characters are interspersed among them. These are someWindows-1252 Latin characters used forFrench, since this European language has some historic relevance in former French colonies in North Africa such asMorocco andAlgeria. This allowed French and Arabic text to be intermixed when using Windows-1256 without any need for code-page switching (however, upper-case letters with diacritics were not included).

IBM usescode page 1256 (CCSID 1256,euro sign extended CCSID 5352, and the further extended CCSID 9448 for some letters used in modern Persian and Urdu) for Windows-1256.[1][2][3][4]

Unicode is preferred over Windows-1256 in modern applications, especially on the Internet, where the dominantUTF-8 encoding is most used for web pages, including for Arabic (see alsoArabic script in Unicode, for complete coverage, unlike for e.g. Windows-1256 orISO/IEC 8859-6 that do not cover extras). Less than 0.03% of all web pages use Windows-1256 in October 2022,[5][6] and while that encoding is mostly used for Arabic, and second-most popular for it, it is only used for 1.6% of the Arabic text on the web.

Character set

[edit]

Since the originalcode page left 9 byte values marked as "NOT USED" in the original specification (hexadecimal 0x80, 0x8A, 0x8F, 0x98, 0x9A, 0x9F, 0xAA, 0xC0, and 0xFF),[7] these bytes were used later for theeuro sign, and for additional letters in thePerso-Arabic script (for thePersian andUrdu languages).[8]

The following table shows the extended version of Windows-1256. Each character is shown with itsUnicode equivalent and its decimal code.

Here every Arabic letter is shown in isolated form. The actual forms of the letters inside Arabic words are rendered by a combination of software rules and appropriate font support.

Windows-1256[8][9][10][11][12][13][14]
0123456789ABCDEF
0xNULSOHSTXETXEOTENQACKBELBSHTLFVTFFCRSOSI
1xDLEDC1DC2DC3DC4NAKSYNETBCANEMSUBESCFSGSRSUS
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~DEL
8xپƒˆٹŒچژڈ
9xگکڑœZWNJZWJں
AxNBSP،¢£¤¥¦§¨©ھ«¬SHY®¯
Bx°±²³´µ·¸¹؛»¼½¾؟
Cxہءآأؤإئابةتثجحخد
Dxذرزسشصض×طظعغـفقك
Exàلâمنهوçèéêëىيîï
Fxًٌٍَôُِ÷ّùْûüLRMRLMے
  Differences fromWindows-1252

See also

[edit]

References

[edit]
  1. ^"Code page 1256 information document". Archived fromthe original on 2016-03-03.
  2. ^"CCSID 1256 information document". Archived fromthe original on 2016-03-27.
  3. ^"CCSID 5352 information document". Archived fromthe original on 2014-11-29.
  4. ^"CCSID 9448 information document". Archived fromthe original on 2014-11-29.
  5. ^"Historical trends in the usage of character encodings for websites, October 2022".w3techs.com.
  6. ^"Frequently Asked Questions".w3techs.com.
  7. ^Archiveddocs."Code Page 1256 Windows Arabic".docs.microsoft.com.
  8. ^ab"cp1256 to Unicode table"(PDF).www.unicode.org. Retrieved2019-05-31.
  9. ^Unicode mappings of windows 1256 with "best fit"
  10. ^Code Page CPGID 01256 (pdf)(PDF), IBM
  11. ^Code Page CPGID 01256 (txt), IBM
  12. ^International Components for Unicode (ICU), ibm-1256_P110-1997.ucm, 2002-12-03
  13. ^International Components for Unicode (ICU), ibm-5352_P100-1998.ucm, 2002-12-03
  14. ^International Components for Unicode (ICU), ibm-9448_X100-2005.ucm, 2005-11-15

External links

[edit]
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Windows-1256&oldid=1277922403"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp