Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

ISO/IEC 8859-8

From Wikipedia, the free encyclopedia
International standard
ISO-8859-8: Latin/Hebrew
MIME / IANAISO-8859-8
Alias(es)iso-ir-138, hebrew, csISOLatinHebrew[1]
LanguagesHebrew,English
StandardISO/IEC 8859-8, ECMA-121, SI 1311
Classificationextended ASCII,ISO 8859
Based onDEC Hebrew (8-bit),ISO/IEC 8859-1
Other related encodingWindows-1255

ISO/IEC 8859-8,Information technology — 8-bit single-byte coded graphic character sets — Part 8: Latin/Hebrew alphabet, is part of theISO/IEC 8859 series of ASCII-based standardcharacter encodings.ISO/IEC 8859-8:1999 from 1999 represents its second and current revision, preceded by the first editionISO/IEC 8859-8:1988 in 1988. It is informally referred to asLatin/Hebrew.ISO/IEC 8859-8 covers all theHebrew letters, but noHebrew vowel signs. IBM assignedcode page 916 (CCSIDs 916 and 5012) to it.[2][3][4] This character set was also adopted byIsraeli Standard SI1311:2002, with some extensions.

ISO-8859-8 is theIANA preferred charset name for this standard when supplemented with theC0 and C1 control codes fromISO/IEC 6429. The text is (usually) in logical order, sobidi processing is required for display. NominallyISO-8859-8 (code page 28598) is for “visual order”, andISO-8859-8-I (code page 38598) is for logical order. But usually in practice, and required for XML documents,[citation needed]ISO-8859-8 also stands for logical order text. TheWHATWG Encoding Standard used byHTML5 treats ISO-8859-8 andISO-8859-8-I as distinct encodings with the same mapping due to influence on the layout direction, but notes that this no longer applies toISO-8859-6 (Arabic), only to ISO-8859-8.[5]

There is alsoISO-8859-8-E which supposedly requires directionality to be explicitly specified with special control characters; this latter variant is in practice unused.

TheMicrosoft Windows code page for Hebrew,Windows-1255, is mostly an extension of ISO/IEC 8859-8 without C1 controls, except for the omission of the double underscore, and replacement of the generic currency sign (¤) with thesheqel sign (₪). It adds support forvowel points as combining characters, and some additional punctuation.

Over a decade after the publication of that standard,Unicode is preferred, at least for the Internet[6] (meaningUTF-8, the dominant encoding for web pages). ISO-8859-8 is used by less than 0.1% of websites.[7]

Code page layout

[edit]
ISO/IEC 8859-8[8][9][10][11]
0123456789ABCDEF
0x
1x
2x SP !"#$%&'()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x`abcdefghijklmno
7xpqrstuvwxyz{|}~
8x
9x
AxNBSP¢£¤¥¦§¨©×«¬SHY®¯
Bx°±²³´µ·¸¹÷»¼½¾
Cx
Dx
Exאבגדהוזחטיךכלםמן
FxנסעףפץצקרשתLRMRLM
  Different from bothDEC Hebrew (8-bit) andISO-8859-1.

FD is left-to-right mark (U+200E) and FE is right-to-left mark (U+200F), as specified in a newer amendment as ISO/IEC 8859-8:1999.

2002 Israeli Standard extensions

[edit]

Israeli Standard SI1311:2002 matches ISO/IEC 8859-8:1999 except for a number of additional character allocations for theeuro sign,new shekel sign and more advancedexplicit bidirectional formatting.[12]

SI1311:2002[12]
0123456789ABCDEF
DxLRORLOPDF
Exאבגדהוזחטיךכלםמן
FxנסעףפץצקרשתLRERLELRMRLM
  Absent from ISO/IEC 8859-8:1999, added in SI1311:2002.

See also

[edit]

References

[edit]
  1. ^Character Sets,Internet Assigned Numbers Authority (IANA), 2018-12-12
  2. ^"Code page 916 information document". Archived fromthe original on 2017-02-16.
  3. ^"CCSID 916 information document". Archived fromthe original on 2014-11-29.
  4. ^"CCSID 5012 information document". Archived fromthe original on 2016-03-27.
  5. ^van Kesteren, Anne."9. Legacy single-byte encodings".Encoding Standard.WHATWG.Note: ISO-8859-8 and ISO-8859-8-I are distinct encoding names, because ISO-8859-8 has influence on the layout direction. And although historically this might have been the case for ISO-8859-6 and "ISO-8859-6-I" as well, that is no longer true.
  6. ^John, Nicholas A. (2013)."The Construction of the Multilingual Internet: Unicode, Hebrew, and Globalization".Journal of Computer-Mediated Communication.18 (3):321–338.doi:10.1111/jcc4.12015.ISSN 1083-6101.Background: the problem of Hebrew and the Internet
  7. ^"Usage Statistics of ISO-8859-8 for Websites, January 2019".w3techs.com. Retrieved2019-01-17.
  8. ^Code Page CPGID 00916 (pdf)(PDF), IBM
  9. ^Code Page CPGID 00916 (txt), IBM
  10. ^International Components for Unicode (ICU), ibm-916_P100-1995.ucm, 2002-12-03
  11. ^International Components for Unicode (ICU), ibm-5012_P100-1999.ucm, 2002-12-03
  12. ^abStandards Institution of Israel.ISO-IR-234: Latin/Hebrew character set for 8-bit codes(PDF). ITSCJ/IPSJ.

External links

[edit]
EBCDIC
DEC
ISO/IEC
DOS
Windows
Macintosh
Atari
National
Multibyte
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
1–9999
10000–19999
20000–29999
30000+
Retrieved from "https://en.wikipedia.org/w/index.php?title=ISO/IEC_8859-8&oldid=1242296778"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp