![]() ISO/IEC 646 Invariant. Redlooped squares (⌘) denote national code points. Other red characters are changed in noteworthy minor modifications. | |
Standard | ISO/IEC 646,ITU T.50 |
---|---|
Classification | 7-bitBasic Latin encoding |
Preceded by | US-ASCII |
Succeeded by | ISO/IEC 8859,ISO/IEC 10646 |
Other related encoding(s) | DEC NRCS,World System Teletext Adaptations to other alphabets: ELOT 927,Symbol,KOI-7,SRPSCII and MAKSCII,ASMO 449,SI 960 |
ISO/IEC 646 is a set ofISO/IEC standards, described asInformation technology — ISO 7-bit coded character set for information interchange, and developed in cooperation withASCII at least since 1964.[1][2] Since its first edition in 1967[3] it has specified a 7-bit character code from which several national standards are derived.
ISO/IEC 646 was also ratified byECMA asECMA-6. The first version of ECMA-6 had been published in 1965,[4] based on work the ECMA's Technical Committee TC1 had carried out since December 1960.[4]
Characters in the ISO/IEC 646 Basic Character Set areinvariant characters.[5] Since that portion of ISO/IEC 646, that is theinvariant character set shared by all countries, specified only those letters used in theISO basic Latin alphabet, countries using additional letters needed to create national variants of ISO/IEC 646 to be able to use their native scripts. Since transmission and storage of 8-bit codes was not standard at the time, the national characters had to be made to fit within the constraints of 7 bits, meaning that some characters that appear inASCII do not appear in other national variants of ISO/IEC 646.
ISO/IEC 646 and its predecessorASCII (ASA X3.4) largely endorsed existing practice regarding character encodings in thetelecommunications industry.
As ASCII did not provide a number of characters needed for languages other than English, a number of national variants were made that substituted some less-used characters with needed ones. Due to the incompatibility of the various national variants, an International Reference Version (IRV) of ISO/IEC 646 was introduced, in an attempt to at least restrict the replaced set to the same characters in all variants. The original version (ISO 646 IRV) differed fromASCII only in that code point 0x24, ASCII'sdollar sign ($) was replaced by theinternational currency symbol (¤). The final 1991 version of the code ISO/IEC 646:1991 is also known asITU T.50, International Reference Alphabet or IRA, formerlyInternational Alphabet No. 5 (IA5). This standard allows users to exercise the 12 variable characters (i.e., two alternative graphic characters and 10 national defined characters). Among these exercises, ISO 646:1991 IRV (International Reference Version) is explicitly defined and identical toASCII.[6]
TheISO/IEC 8859 series of standards governing 8-bit character encodings supersede the ISO/IEC 646 international standard and its national variants, by providing 96 additional characters with the additional bit and thus avoiding any substitution of ASCII codes. TheISO/IEC 10646 standard, directly related toUnicode, supersedes all of the ISO646 and ISO/IEC 8859 sets with one unified set of character encodings using a larger 21-bit value.
A legacy of ISO/IEC 646 is visible on Windows, where in many East Asian locales thebackslash character used infilenames is rendered as¥ or other characters such as₩. Despite the fact that a different code for ¥ was available even on the original IBM PC'scode page 437, and a separate double-byte code for ¥ is available inShift JIS (although this often usesalternative mapping), so much text was created with the backslash code used for ¥ (due to Shift_JIS being officially based on ISO 646:JP, although Microsoft maps it as ASCII) that even modern Windows fonts have found it necessary to render the code that way. A similar situation exists with ₩ andEUC-KR. Another legacy is the existence oftrigraphs in theC programming language.
The following table shows the ISO/IEC 646 Invariant character set. Each character is shown with itsUnicode equivalent. National code points are gray with the ASCII character that is replaced. Yellow indicates a character that, in some regions, could be combined with a previous character as adiacritic using thebackspace character, which may affectglyph choice.
In addition to the invariant set restrictions, 0x23 is restricted to be either# or£ and 0x24 is restricted to be either$ or¤ in ECMA-6:1991, equivalent to ISO/IEC 646:1991.[11] However, these restrictions are not followed by all national variants.[14][15]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
0x | NUL | SOH | STX | ETX | EOT | ENQ | ACK | BEL | BS | HT | LF | VT | FF | CR | SO | SI |
1x | DLE | DC1 | DC2 | DC3 | DC4 | NAK | SYN | ETB | CAN | EM | SUB | ESC | FS | GS | RS | US |
2x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | < | = | > | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
6x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | DEL |
Some national variants of ISO/IEC 646 are as follows:
Code | ISO-IR | ISO/IEC ESC | Approved | National Standard | Description |
---|---|---|---|---|---|
CA | 121 | ESC 2/8 7/7 | ISO 646 | CSA Z243.4-1985-1 | Canada (No. 1 alternative, with "î") (French, classical) (Code page 1020[16]) |
CA2 | 122 | ESC 2/8 7/8 | ISO 646 | CSA Z243.4-1985-2 | Canada (No. 2 alternative, with "É") (French, reformed orthography) |
CN | 57[17] | ESC 2/8 5/4 | ? | GB/T 1988-80 | People's Republic of China (Basic Latin) |
CU | 151 | ESC 2/8 2/1 4/1 | ISO 646 | NC 99-10:81 / NC NC00-10:81 | Cuba (Spanish) |
DANO | 9-1[18] | ESC 2/8 4/5[18] | SIS | NATS-DANO | Norway andDenmark (journalistic texts). Invariant code point 0x22 is displayed as« , (compare" in the IRV). It is, however, still considered a double quotation mark.[19] Accompanies SEFI (NATS-SEFI). |
DE | 21[18][17] | ESC 2/8 4/11[18] | ISO 646 | DIN66003 | Germany (German) (Code page 1011,[20]20106[21][22][23]) |
DK | — | ? | DS 2089[24][25] | Denmark (Danish) (Code page 1017[26]) | |
ES | 17[18] | ESC 2/8 5/10[18] | ECMA | Olivetti | Spanish (international) (Code page 1023[27]) |
ES2 | 85[17] | ESC 2/8 6/8 | ECMA | IBM | Spain (Basque,Castilian,Catalan,Galician) (Code page 1014[28]) |
FI | 10[17] | ISO 646 | SFS 4017 | Finland (basic version) (Code page 1018[29]) | |
FR | 69[17] | ESC 2/8 6/6 | ISO 646 | AFNOR NF Z 62010-1982 | France (French) (Code page 1010[30]) |
FR1 | 25[18][17] | ESC 2/8 5/2[18] | ISO 646 | AFNOR NF Z 62010-1973 | France (obsolete since April 1985) (Code page 1104[31]) |
GB | 4[18][17] | ESC 2/8 4/1[18] | ISO 646 | BS 4730 | United Kingdom (English) (Code page 1013[32]) |
HU | 86 | ESC 2/8 6/9 | ISO 646 | MSZ 7795/3 | Hungary (Hungarian) |
IE | 207 | ? | NSAI 433:1996 | Ireland (Irish) | |
INV | 170 | ESC 2/8 2/1 4/2 | ISO 646 | ISO 646:1983 | Invariant subset |
(IRV) | 2[18][17] | ESC 2/8 4/0[18] | ISO 646 | ISO 646:1973 | International Reference Version. 0x7E as an overline (ISO-IR-002).[33] |
? | ? | ISO 646 | ISO 646:1983 | International Reference Version. 0x7E as a tilde (Code page 1009,[34]20105[21][22][35]). | |
ISO 646:1991 International Reference Version matches the US variant (see below). | |||||
IS | — | — | — | — | Iceland (Icelandic)De facto standard, proposed in 1978 but never formally approved. |
IT | 15[18][17] | ESC 2/8 5/9[18] | ECMA | UNI 0204-70 /Olivetti? | Italian (Code page 1012[36]) |
JP | 14[18][17] | ESC 2/8 4/10[18] | ISO 646 | JISC 6220:1969-ro | Japan (Romaji) (Code page 895[37]). Also used as an 8-bit code withthe corresponding Katakana supplementary set. |
JP-OCR-B | 92 | ESC 2/8 6/14 | ISO 646 | JIS C 6229-1984-b | Japan (OCR-B) |
KR | — | ? | KS C 5636-1989 | South Korea | |
MT | — | ? | ? | Malta (Maltese,English) | |
NL | — | ECMA | IBM | Netherlands (Dutch) (Code page 1019[38]) | |
NO | 60[17] | ESC 2/8 6/0 | ISO 646 | NS 4551 version 1[17] | Norway (Code page 1016[39]) |
NO2 | 61[17] | ESC 2/8 6/1 | ISO 646 | NS 4551 version 2[17] | Norway (obsolete since June 1987) (Code page 20108[21][22][40]) |
PL-2002 | — | — | PKN | PN-I-10050:2002[41] | Poland (current as of 2025) Set for writingPolish. Includes theEuro sign. |
PL-ZU0 | — | — | PKN | PN-T-42109-02:1984[42] | Poland (withdrawn in 2000) Set named "ZU0" for writingPolish. |
PT | 16[17] | ESC 2/8 4/12 | ECMA | Olivetti | Portuguese (international) |
PT2 | 84[17] | ESC 2/8 6/7 | ECMA | IBM | Portugal (Portuguese,Spanish) (Code page 1015[43]) |
SE | 10[18][17] | ESC 2/8 4/7[18] | ISO 646 | SEN 850200 Annex B,SIS 63 61 27 | Sweden (basicSwedish) (Code page 1018,[29] D47) |
SE2 | 11[18][17] | ESC 2/8 4/8[18] | ISO 646 | SEN 850200 Annex C,SIS 63 61 27 | Sweden (extendedSwedish for names) (Code page 20107,[21][22][44] E47) |
SEFI | 8-1[18] | ESC 2/8 4/3[18] | SIS | NATS-SEFI | Sweden andFinland (journalistic texts). Accompanies DANO (NATS-DANO). |
T.61-7bit | 102 | ESC 2/8 7/5 | ? | ITU/CCITTT.61 Recommendation | International (Teletex). Also used withthe corresponding supplementary set as an 8-bit code. |
TW | — | ? | CNS 5205-1996 | Republic of China (Taiwan) | |
US / (IRV) | 6[18][17] | ESC 2/8 4/2[18] | ISO 646 | ANSI X3.4-1968 and ISO 646:1983 (also IRV in ISO/IEC 646:1991) | United States (ASCII,Code page 367,[45]20127[21][22][46]) |
YU | 141 | ESC 2/8 7/10 | ISO 646 | JUS I.B1.002 (YUSCII) | former Yugoslavia (Croatian,Slovene,Serbian,Bosnian) |
INIS | 49 | ESC 2/8 5/7 | IAEA | INIS | ISO 646 IRV subset |
Some national character sets also exist which are based on ISO/IEC 646 but do not strictly follow its invariant set (see also§ Derivatives for other alphabets):
Character set | ISO-IR | ISO ESC | Approved | National Standard | Description |
---|---|---|---|---|---|
BS_viewdata | 47 | ESC 2/8 5/6 | British Post Office | Viewdata andTeletext. Viewdata square (⌗) substituted for normally invariant underscore (_) which cannot be displayed on the target hardware.[47] This is actually the encoding of Microsoft's WST_Engl. | |
GR / greek7 | 88 | ESC 2/8 6/10 | ? | HOSELOT 927 | Greece (withdrawn in November 1986). Uses Greek letters in place of Roman ones[48] and hence is not strictly speaking an ISO 646 variant. |
greek7-old | 18 | ESC 2/8 5/11 | ECMA | ? | Greek graphic set. Similar in concept togreek7, but uses a different mapping of letters. Also, the upper case follows the lower case. |
Latin-Greek | 19 | ESC 2/8 5/12 | ECMA | ? | Latin-Greek combined graphics (capitals only). Followsgreek7-old, but includes Latin capitals without modification, and Greek capitals over the Latin lower case. |
Latin-Greek-1 | 27[18] | ESC 2/8 5/5[18] | ECMA | Honeywell-Bull | Latin-Greek mixed graphics (Greek capitals only).[18] Visually unifies Greek capitals with Latin capitals where possible, and adds the remaining Greek capitals. Unlike the other Greek versions, all Basic Latin letters remain intact. Replaces invariant punctuation as well as national characters, however,[49] and hence is still not strictly speaking an ISO 646 variant. |
swi | — | ECMA | Olivetti | Switzerland (French,German) (Code page 1021[50]) Invariant code point 0x5F is changed from_ toè . Is aDEC NRCS variant, closely related to ISO 646, but lacks a fully ISO 646 compliant equivalent. | |
pl-ZU1 | — | — | PKN | PN-T-42109-02[42] | Poland (withdrawn in 2000) Set named "ZU1" intended for use withODRA 1300 mainframes. These use the same character set as ICT 1900 mainframes, which was based on a 1963 proposed version of ASCII prior to its standardization. |
All the variants listed above are solely graphical character sets, and are to be used with aC0 control character set such as listed in the following table:
ISO-IR | ISO ESC | Approved | Description |
---|---|---|---|
1[18] | ESC 2/1 4/0[18] | ISO 646 | ISO 646 controls[18] ("ASCII controls") |
7[18] | ESC 2/1 4/1[18] | ISO 646 | Scandinavian newspaper (NATS) controls[18] |
26[18] | ESC 2/1 4/3[18] | ISO 646 | IPTC controls[18] |
The following table lists supplementary graphical character sets defined by the same standard as specific ISO/IEC 646 variants. These would be selected by using a mechanism such asshift out or the NATS super shift (single shift),[51] or by setting the eighth bit in environments where one was available:
ISO-IR | ISO/IEC ESC | National Standard | Description |
---|---|---|---|
8-2[18] | ESC 2/8 4/4[18] | NATS-SEFI-ADD | Supplementary code used with NATS-SEFI. |
9-2[18] | ESC 2/8 4/6[18] | NATS-DANO-ADD | Supplementary code used with NATS-DANO. |
13[18][17] | ESC 2/8 4/9[18] | JISC 6220:1969-jp | Katakana, used as a supplementary code with ISO-646-JP. |
103 | ESC 2/8 7/6 | ITU/CCITTT.61 Recommendation, Supplementary Set | Supplementary code used with T.61. |
— | — | PN-T-42109-03:1986[52] | (withdrawn in 2000) Set named "ZU2" for writingPolish. Contains all letters used in Polish, including the uppercase letters missing from ZU0. Intended to be used as a supplementary set with either the IRV, ZU0, or ZU1 as the primary set. |
The specifics of the changes for some of these variants are given in the following table. Character assignments unchanged across all listed variants (i.e. which remain the same as ASCII) are not shown.
For ease of comparison, variants detailed include national variants of ISO/IEC 646, DEC's closely relatedNational Replacement Character Set (NRCS) series used onVT200 terminals, the related EuropeanWorld System Teletext encoding series defined inETS 300 706, and a few other closely related encodings based on ISO/IEC 646. Individual code charts are linked from the second column. The cells with non-white background emphasize the differences fromUS-ASCII (also theBasic Latin subset of ISO/IEC 10646 and Unicode).
Several characters could be used ascombining characters, when preceded or followed with abackspaceC0 control. This is attested in the code charts for IRV, GB, FR1, CA, and CA2, which note that"',^
would behave as thediaeresis,acute accent,cedilla, andcircumflex (rather thanquotation marks, acomma, and anupward arrowhead) when preceded or followed by a backspace. The current PL standard explicitly directs the use of the backspace and apostrophe to form Polish letters with an acute accent. The tilde character (~) was similarlyintroduced as a diacritic (˜). This encoding method originated in the typewriter/teletype era when use of backspace would overstamp a glyph, and may be considereddeprecated.
Later, when wider character sets gained more acceptance,ISO/IEC 8859, vendor-specific character sets and eventuallyUnicode became the preferred methods of coding most of these variants.
Variant Code | Code Chart | Characters for each ISO 646 / NRCS compatible or derived charset | |||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
US / IRV (1991) | ISO-IR-006[53] | ! | " | # | $ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ~ |
Older International Reference Versions | |||||||||||||||||||
IRV (1973) | ISO-IR-002[33] | ! | " | # | ¤ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ‾ |
IRV (1983) | CP01009[54] | ! | " | # | ¤ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ~ |
Invariant and other IRV subsets | |||||||||||||||||||
INV | ISO-IR-170[55] | ! | " | & | : | ? | _ | ||||||||||||
INV (NRCS)[a] | --- | ! | " | $ | & | : | ? | ||||||||||||
INV (Teletext)[a] | ETS WST[56] | ! | " | & | : | ? | |||||||||||||
INIS Subset[a] | ISO-IR-049[57] | $ | : | [ | ] | | | |||||||||||||
T.61 | ISO-IR-102[58] | ! | " | # | ¤ | & | : | ? | @ | [ | ] | _ | | | ||||||
East Asian | |||||||||||||||||||
JP | ISO-IR-014[59] | ! | " | # | $ | & | : | ? | @ | [ | ¥ | ] | ^ | _ | ` | { | | | } | ‾ |
JP-OCR-B | ISO-IR-092[60] | ! | " | # | $ | & | : | ? | @ | [ | ¥ | ] | ^ | _ | { | | | } | ||
KR | (KS X 1003)[61] | ! | " | # | $ | & | : | ? | @ | [ | ₩ | ] | ^ | _ | ` | { | | | } | ‾ |
CN | ISO-IR-057[15] | ! | " | # | ¥ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ‾ |
TW | (CNS 5205)[61] | ! | " | # | $ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ‾ |
British and Irish | |||||||||||||||||||
GB | ISO-IR-004[62] | ! | " | £ | $ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ‾ |
GB (NRCS) | CP01101[63] | ! | " | £ | $ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ~ |
Viewdata[b][c] | ISO-IR-047[47] | ! | " | £ | $ | & | : | ? | @ | ← | ½ | → | ↑ | ⌗ | ― | ¼ | ‖ | ¾ | ÷ |
IE | ISO-IR-207[64] | ! | " | £ | $ | & | : | ? | Ó | É | Í | Ú | Á | _ | ó | é | í | ú | á |
Italophone or Francophone | |||||||||||||||||||
IT[d] | ISO-IR-015[65] | ! | " | £ | $ | & | : | ? | § | ° | ç | é | ^ | _ | ù | à | ò | è | ì |
IT (Teletext)[c] | ETS WST[66] | ! | " | £ | $ | & | : | ? | é | ° | ç | → | ↑ | ⌗ | ù | à | ò | è | ì |
FR (1983) | ISO-IR-069[67] | ! | " | £ | $ | & | : | ? | à | ° | ç | § | ^ | _ | µ | é | ù | è | ¨ |
FR (1973)[d] | ISO-IR-025[68] | ! | " | £ | $ | & | : | ? | à | ° | ç | § | ^ | _ | ` | é | ù | è | ¨ |
FRTeletext[c] | ETS WST[66] | ! | " | é | ï | & | : | ? | à | ë | ê | ù | î | ⌗ | è | â | ô | û | ç |
CA[d] | ISO-IR-121[69] | ! | " | # | $ | & | : | ? | à | â | ç | ê | î | _ | ô | é | ù | è | û |
CA2 | ISO-IR-122[70] | ! | " | # | $ | & | : | ? | à | â | ç | ê | É | _ | ô | é | ù | è | û |
Francophone-Germanophone | |||||||||||||||||||
swi (NRCS)[c] | CP01021[71] | ! | " | ù | $ | & | : | ? | à | é | ç | ê | î | è | ô | ä | ö | ü | û |
Germanophone | |||||||||||||||||||
DE[d][e] | ISO-IR-021[72] | ! | " | # | $ | & | : | ? | § | Ä | Ö | Ü | ^ | _ | ` | ä | ö | ü | ß |
Nordic (Eastern) and Baltic | |||||||||||||||||||
FI / SE | ISO-IR-010[73] | ! | " | # | ¤ | & | : | ? | @ | Ä | Ö | Å | ^ | _ | ` | ä | ö | å | ‾ |
SE2[f] | ISO-IR-011[74] | ! | " | # | ¤ | & | : | ? | É | Ä | Ö | Å | Ü | _ | é | ä | ö | å | ü |
SE (NRCS) | CP01106[75] | ! | " | # | $ | & | : | ? | É | Ä | Ö | Å | Ü | _ | é | ä | ö | å | ü |
FI (NRCS) | CP01103[76] | ! | " | # | $ | & | : | ? | @ | Ä | Ö | Å | Ü | _ | é | ä | ö | å | ü |
SEFI (NATS)[g] | ISO-IR-008-1[77] | ! | " | # | $ | & | : | ? | | Ä | Ö | Å | ■ | _ | | ä | ö | å | – |
EE (Teletext)[c] | ETS WST[66] | ! | " | # | õ | & | : | ? | Š | Ä | Ö | Ž | Ü | Õ | š | ä | ö | ž | ü |
LV / LT (Teletext)[c] | ETS WST[66] | ! | " | # | $ | & | : | ? | Š | ė | ę | Ž | č | ū | š | ą | ų | ž | į |
Nordic (Western) | |||||||||||||||||||
DK | CP01017[78] | ! | " | # | ¤ | & | : | ? | @ | Æ | Ø | Å | Ü | _ | ` | æ | ø | å | ü |
DK/NO (NRCS) | CP01105[79] | ! | " | # | $ | & | : | ? | Ä | Æ | Ø | Å | Ü | _ | ä | æ | ø | å | ü |
DK/NO-alt (NRCS) | CP01107[80] | ! | " | # | $ | & | : | ? | @ | Æ | Ø | Å | ^ | _ | ` | æ | ø | å | ~ |
NO | ISO-IR-060[81] | ! | " | # | $ | & | : | ? | @ | Æ | Ø | Å | ^ | _ | ` | æ | ø | å | ‾ |
NO2 | ISO-IR-061[14] | ! | " | § | $ | & | : | ? | @ | Æ | Ø | Å | ^ | _ | ` | æ | ø | å | | |
DANO (NATS)[g][h] | ISO-IR-009-1[19] | ! | « | » | $ | & | : | ? | | Æ | Ø | Å | ■ | _ | | æ | ø | å | – |
IS | [proposed][82] | ! | " | # | ¤ | & | : | ? | Ð | Þ | ´[i] | Æ | Ö | _ | ð | þ | ´[i] | æ | ö |
Hispanophone | |||||||||||||||||||
ES[d] | ISO-IR-017[83] | ! | " | £ | $ | & | : | ? | § | ¡ | Ñ | ¿ | ^ | _ | ` | ° | ñ | ç | ~ |
ES2 | ISO-IR-085[84] | ! | " | # | $ | & | : | ? | · | ¡ | Ñ | Ç | ¿ | _ | ` | ´ | ñ | ç | ¨ |
CU | ISO-IR-151[85] | ! | " | # | ¤ | & | : | ? | @ | ¡ | Ñ | ] | ¿ | _ | ` | ´ | ñ | [ | ¨ |
Hispanophone-Lusophone | |||||||||||||||||||
ES/PTTeletext[c] | ETS WST[66] | ! | " | ç | $ | & | : | ? | ¡ | á | é | í | ó | ú | ¿ | ü | ñ | è | à |
Lusophone | |||||||||||||||||||
PT | ISO-IR-016[86] | ! | " | # | $ | & | : | ? | § | Ã | Ç | Õ | ^ | _ | ` | ã | ç | õ | ° |
PT2 | ISO-IR-084[87] | ! | " | # | $ | & | : | ? | ´ | Ã | Ç | Õ | ^ | _ | ` | ã | ç | õ | ~ |
PT (NRCS) | --- | ! | " | # | $ | & | : | ? | @ | Ã | Ç | Õ | ^ | _ | ` | ã | ç | õ | ~ |
Greek | |||||||||||||||||||
Latin-GR mixed[c] | ISO-IR-027[49] | Ξ | " | Γ | ¤ | & | Ψ | Π | Δ | Ω | Θ | Φ | Λ | Σ | ` | { | | | } | ‾ |
ISO-IR-088 (GR / ELOT 927), ISO-IR-018 and ISO-IR-019 replace Roman letters with Greek letters and aredetailed in a separate chart. | |||||||||||||||||||
Slavic (Latin script) | |||||||||||||||||||
YU | ISO-IR-141[88] | ! | " | # | $ | & | : | ? | Ž | Š | Đ | Ć | Č | _ | ž | š | đ | ć | č |
YUTeletext[c] | ETS WST[66] | ! | " | # | Ë | & | : | ? | Č | Ć | Ž | Đ | Š | ë | č | ć | ž | đ | š |
YU-altTeletext[c] | ETS WST[66] | ! | " | # | $ | & | : | ? | Č | Ć | Ž | Đ | Š | ë | č | ć | ž | đ | š |
CS/CZ/SK (Teletext)[c] | ETS WST[66] | ! | " | # | ů | & | : | ? | č | ť | ž | ý | í | ř | é | á | ě | ú | š |
PL-2002 | PN-I-10050[41] | ! | " | # | $ | & | : | ? | @ | Ą | Ę | Ł | Ż | _ | € | ą | ę | ł | ż |
PL-ZU0 | PN-T-42109-02[42] | ! | " | # | ¤ | & | : | ? | ę | ź | Ł | ń | ś | _ | ą | ó | ł | ż | ć |
PL-ZU1[c] | PN-T-42109-02[42] | ! | " | # | £ | & | : | ? | @ | [ | $ | ] | ↑ | ← | _ | ||||
PLTeletext[c] | ETS WST[66] | ! | " | # | ń | & | : | ? | ą | Ƶ | Ś | Ł | ć | ó | ę | ż | ś | ł | ź |
Adaptations for the Cyrillic script replace Roman letters and aredetailed in a separate chart | |||||||||||||||||||
Other | |||||||||||||||||||
NL | CP01019[89] | ! | " | # | $ | & | : | ? | @ | [ | \ | ] | ^ | _ | ` | { | | | } | ‾ |
NLNRCS | CP01102[90] | ! | " | £ | $ | & | : | ? | ¾ | ij | ½ | | | ^ | _ | ` | ¨ | ƒ | ¼ | ´ |
HU | ISO-IR-086[91] | ! | " | # | ¤ | & | : | ? | Á | É | Ö | Ü | ^ | _ | á | é | ö | ü | ˝ |
MT | CP03041[92] | ! | " | # | $ | & | : | ? | @ | ġ | ż | ħ | ^ | _ | ċ | Ġ | Ż | Ħ | Ċ |
RO (Teletext)[c] | ETS WST[66] | ! | " | # | ¤ | & | : | ? | Ţ | Â | Ş | Ă | Î | ı | ţ | â | ş | ă | î |
TR (VT420)[c] | [93] | ı | " | # | $ | ğ | : | ? | İ | Ş | Ö | Ç | Ü | _ | Ğ | ş | ö | ç | ü |
TR (Teletext)[c] | ETS WST[66] | ! | " | TL | ğ | & | : | ? | İ | Ş | Ö | Ç | Ü | Ğ | ı | ş | ö | ç | ü |
@
(0x40) and`
(0x60) with "Unit space A" (UA) and "Unit space B" (UB). The plain space (0x20) expands on justification. UA and UB are for fixed widths, UA must be at least as wide as UB.RFC 1345 maps UA and UB toISO 10646 (UCS) code points U+E002 and U+E003, both in thePrivate Use Area, respectively (although it also lists PUA mappings for several other characters which now have UCS code points). Unicode containsa number of space characters which might approximately correspond.TheNational Replacement Character Set (NRCS) is a family of 7-bit encodings introduced in 1983 byDEC with theVT200 series of computer terminals. It is closely related to ISO/IEC 646, being based on a similar invariant subset of ASCII, differing in retaining$
as invariant but not_
(although most NRCS variants retain the_
, and hence comply with the ISO/IEC 646 invariant set). Most NRCS variants are closely related to corresponding national ISO/IEC 646 variants where they exist, with the exception of theDutch variant.
TheEuropean telecommunications standard ETS 300 706, "Enhanced Teletext specification", defines Latin, Greek, Cyrillic, Arabic, and Hebrew code sets with several national variants for both Latin and Cyrillic.[66] Like NRCS and ISO/IEC 646, within the Latin variants, the family of encodings known as theG0 set are based on a similar invariant subset of ASCII, but do not retain either$
nor_
as invariant. Unlike NRCS, variants often differ considerably from corresponding national ISO/IEC 646 variants.
HP hascode page 1054, which adds the medium shade (▒, U+2592) at 0x7F.[94]Code page 1052 replaces a few ASCII characters from code page 1054.[95]
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
2x | SP | ! | ″ | # | $ | % | & | ′ | ( | ) | * | + | , | - | . | / |
3x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | ; | ‗ | = | ¢ | ? |
4x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
5x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | ® | ] | © | _ |
6x | ° | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
7x | p | q | r | s | t | u | v | w | x | y | z | § | ¶ | † | ™ | ▒ |
Some 7-bit character sets for non-Latin alphabets are derived from the ISO/IEC 646 standard: these do not themselves constitute ISO/IEC 646 due to not following its invariant code points (often replacing the letters of at least one case), due to supporting differing alphabets which the set of national code points provide insufficient encoding space for. Examples include:
A comparison of some of these encodings is below. Only one case is shown, except in instances where the cases are mapped to different letters. In such instances, the mapping with the smallest code is shown first. Possible transcriptions are given for some letters; where this is omitted, the letter can be considered to correspond to the Roman one which it is mapped over.
English (ASCII) | Cyrillic alphabets | Greek alphabet | Hebrew | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Semi-transliterative | Naturally ordered | ||||||||||
Russian (KOI-7) | Russian, Bulgarian (WST RU/BG) | Ukrainian (WST UKR) | Serbian (SRPSCII) | Macedonian (MAKSCII) | Serbian, Macedonian[a] (WST SRP) | Greek (Symbol) | Greek (IR-18[97]) | Greek (ELOT 927) | Greek (WST EL) | Hebrew (SI 960) | |
@ ` | Ю (ju/yu) | Ю (ju/yu) | Ю (ju/yu) | Ж (ž) | Ж (ž) | Ч (č) | ≅ ‾ | ´ ` | @ ` | ΐ ΰ | א (ʾ/ʔ) |
A | А | А (a/á) | А | А | А | А | Α | Α | Α | Α | ב (b) |
B | Б | Б | Б | Б | Б | Б | Β | Β | Β | Β | ג (g) |
C | Ц (c/ts) | Ц (c/ts) | Ц (c/ts) | Ц (c/ts) | Ц (c/ts) | Ц (c/ts) | Χ (ch/kh) | Ψ (ps) | Γ (g) | Γ (g) | ד (d) |
D | Д | Д | Д | Д | Д | Д | Δ | Δ | Δ | Δ | ה (h) |
E | Е (je/ye) | Е (je/ye) | Е (e) | Е (e) | Е (e) | Е (e) | Ε | Ε | Ε | Ε | ו (w) |
F | Ф | Ф | Ф | Ф | Ф | Ф | Φ (ph/f) | Φ (ph/f) | Ζ (z) | Ζ (z) | ז (z) |
G | Г | Г | Г | Г | Г | Γ | Γ | Γ | Η (ē) | Η (ē) | ח (ch/kh) |
H | Х (h/kh/ch) | Х (h/kh/ch) | Х (h/kh/ch) | Х (h/kh/ch) | Х (h/kh/ch) | Х (h/kh/ch) | Η (ē) | Η (ē) | Θ (th) | Θ (th) | ט (tt) |
I | И | И | И (y) | И | И | И | Ι | Ι | Ι | Ι | י (j/y) |
J | Й (j/y) | Й (j/y) | Й (j/y) | Ј (j/y) | Ј (j/y) | Ј (j/y) | ϑ (th) ϕ (ph/f) | Ξ (x/ks) | Κ (k) | ך (k final) | |
K | К | К | К | К | К | К | Κ | Κ | Κ | Λ (l) | כ |
L | Л | Л | Л | Л | Л | Л | Λ | Λ | Λ | Μ (m) | ל |
M | М | М | М | М | М | М | Μ | Μ | Μ | Ν (n) | ם (m final) |
N | Н | Н | Н | Н | Н | Н | Ν | Ν | Ν | Ξ (x/ks) | מ (m) |
O | О | О | О | О | О | О | Ο | Ο | Ξ (x/ks) | Ο | ן (n final) |
P | П | П | П | П | П | П | Π | Π | Ο (o) | Π | נ (n) |
Q | Я (ja/ya) | Я (ja/ya) | Я (ja/ya) | Љ (lj/ly) | Љ (lj/ly) | Ќ (Ḱ/kj) | Θ (th) | ͺ ( | Π (p) | Ρ (r) | ס (s) |
R | Р | Р | Р | Р | Р | Р | Ρ | Ρ | Ρ | ʹ ς (s final) | ע (ʽ/ŋ) |
S | С | С | С | С | С | С | Σ | Σ | Σ | Σ | ף (p final) |
T | Т | Т | Т | Т | Т | Т | Τ | Τ | Τ | Τ | פ (p) |
U | У | У | У | У | У | У | Υ | Θ (th) | Υ | Υ | ץ (ṣ/ts final) |
V | Ж (ž) | Ж (ž) | Ж (ž) | В | В | В | ς (s final) ϖ (p) | Ω (ō) | Φ (f/ph) | Φ (f/ph) | צ (ṣ/ts) |
W | В (v) | В (v) | В (v) | Њ (nj/ny/ñ) | Њ (nj/ny/ñ) | Ѓ (ǵ/gj) | Ω (ō) | ς (s final) | ς (s final) | Χ (ch/kh) | ק (q) |
X | Ь (’) | Ь (’) | Ь (’) | Џ (dž) | Џ (dž) | Љ (lj/ly) | Ξ | Χ (ch/kh) | Χ (ch/kh) | Ψ (ps) | ר (r) |
Y | Ы (y/ı) | Ъ (″/ǎ/ŭ) | І (i) | Ѕ (dz) | Ѕ (dz) | Њ (nj/ny/ñ) | Ψ (ps) | Υ (u) | Ψ (ps) | Ω (ō) | ש (š/sh) |
Z | З | З | З | З | З | З | Ζ | Ζ | Ω (ō) | Ϊ | ת (t) |
[ { | Ш (š/sh) | Ш (š/sh) | Ш (š/sh) | Ш (š/sh) | Ш (š/sh) | Ћ (ć) | [ { | ῏ ῟ | [ { | Ϋ | [ { |
\ | | Э (e) | Э (e) | Є (je/ye) | Ђ (đ/dj) | Ѓ (ǵ/gj) | Ж (ž) | ∴ | | ᾿ ῾ (h) | \ | | ά ό | \ | |
] } | Щ (šč) | Щ (šč) | Щ (šč) | Ћ (ć) | Ќ (Ḱ/kj) | Ђ (đ/dj) | ] } | ῎ ῞ | ] } | έ ύ | ] } |
^ ~ | Ч (č) | Ч (č) | Ч (č) | Ч (č) | Ч (č) | Ш (š/sh) | ⊥ ~ | ˜ ¨ | ^ ‾ | ή ώ | ^ ‾ |
_ | Ъ (″) | Ы (y/ı) | Ї (ji/yi) | _ | _ | Џ (dž) | _ | _ | _ | ί | _ |
The Technical Committee TC1 ofECMA met for the first time in December 1960 to prepare standard codes for Input/Output purposes. On [30] April 1965, Standard ECMA-6 was adopted by the General Assembly of ECMA.
{{citation}}
: CS1 maint: numeric names: authors list (link){{cite book}}
: CS1 maint: multiple names: authors list (link){{cite book}}
: CS1 maint: multiple names: authors list (link){{citation}}
: CS1 maint: numeric names: authors list (link){{cite book}}
: CS1 maint: bot: original URL status unknown (link) (79 pages) including:Bemer, Robert William (1972). "a view of the history of the ISO character set".Honeywell Computer Journal.6 (4). Phoenix, Arizona, USA: Honeywell Information Systems:274–286,287–291. (13+5 pages) and many other documents and correspondence.