"ibm-1050" redirects here. For the IBM Data Communications System, seeIBM 1050.
"ibm-1051" redirects here. For the IBM Central Control Unit, seeIBM 1051.
In computingHP Roman is a family ofcharacter sets consisting ofHP Roman Extension,HP Roman-8,HP Roman-9 and several variants. Originally introduced byHewlett-Packard around 1978, revisions and adaptations were published several times up to 1999. The 1985 revisions were later standardized asIBMcodepages 1050 and 1051. Supporting many European languages,[which?] the character sets were used by various HP workstations, terminals, calculators as well as many printers, also from third-parties.
HP Roman is a family ofsingle bytecharacter encodings supporting several Latin script based languages of Europe. It was originally introduced byHewlett-Packard around 1978 as 7- and 8-bitHP Roman Extension for some of theircomputer terminals andprinters. Early versions of the 8-bit variant were also used by some HP workstations in 1978/1979. Several revisions led to more characters being added before the 8-bit variant of the character set became officially known asHP Roman-8 in 1983.[1] Soon later, this became the default character set of theHP-UX[2] operating system and the page description languagePCL forinkjet[3] andlaser printers in 1984. The character set was again expanded in 1985.[4] Amodified adaptation of the 1984 definition of Roman-8 was used in theHP Portable series of computers,[5][6] whereas aderivation of the updated 1985 definition of Roman-8 was used in several earlyRPLcalculators and correspondingthermal printers since 1986.[7] The latest off-spring of the family isHP Roman-9, which was introduced in 1999 to include theeuro sign.[8]PCL Ventura International is based on HP Roman-8.
The character set was originally introduced byHewlett-Packard asextended ASCII 7-bit codepage namedHP Roman Extension,[9][10] which existed at least since 1978.[11][12][13][14][15] This character set was used as a secondary character set in conjunction with the primary character set, which was identical to ASCII, except for character 127, which was a medium shaded box instead of the delete character. The first 32 characters, that normally functioned asC0 control codes, also had graphical non-control alternatives, that could appear during self-test or display functions mode. Switching between character sets was done using theShift Out and Shift In characters, or alternatively, on systems supporting 8-bit mode, using the high bit of the character. Before the name "Roman-8" was established for the 8-bit variant in 1983, this was sometimes called "8-bit Roman Extension" or "HP Roman-8 Extension". Over the years both variants were revised to include more characters. The final 1985 revision of the secondary character set was also standardized byIBM in 1989 ascode page 1050 (CP1050 oribm-1050).[16]
Although strictly speaking not part of Roman Extension, the following table shows those rows of the primary character set that differed from ASCII. Note that the first two rows are normally the same and only appear as graphical characters in special circumstances, as described above. Although some of the Unicode control pictures conventionally use three characters rather than two, those "diagonal lettering glyphs are only exemplary; alternate representations may be, and often are used in the visible display of control codes".[17]
The following table shows the 1982 version; a current variant is shown in theRoman-8 section below. The table assumes 8-bit mode is used; if not, subtract 128 (8016) from the character code.
^abIn Unicode, the pound / lira symbol is unified as U+00A3£ and whether this character has a single or double bar is merely considered a typographical variation of the same character. U+20A4 ₤ exists solely for compatibility with this character set, HP Roman-8, in which the singly barred glyph is coded as BB16 and the doubly barred glyph as AF16.[18]
The original 1983/1984 version ofRoman-8 still had some code points undefined.[5][6][3][19] In a 1985 revision code points 177 (Ý), 178 (ý), 242 (·), 243 (µ), 244 (¶) and 245 (¾) were added and the appearance of code point 228 was changed from a stroked d (đ) to an eth (ð).[4][20][21] This final revision of the character set was also standardized as codepage 1051 byIBM in 1989.[22][23]
In contrast to the newerHP Roman-9, HP Roman-8 does not provide a code point for the euro sign.
The following table shows the latest 1985 definition of the HP Roman-8 character set (with some remarks regarding former definitions and alternative interpretations). Each character is shown with a potentialUnicode equivalent and its decimal code, however, sources differ in the recommended translations for some of the codes even among definitions from Hewlett-Packard[2][21] and IBM.[22][23]
^While this character would display as "▒", it was also the character sent by the delete key of HP terminals, so it could be mapped to U+007FDEL instead.[2][25][26][27][28][29][30][31][32]
In 1984, Hewlett-Packard introduced theHP 110 /HP Portable personal computer followed by theHP 110 Plus /HP Portable Plus in 1985. In "HP mode" they supported a derivation of the 1984 revision of 8-bitHP Roman-8 (still lacking the six additional characters at code points 177 to 178 and 242 to 245, and with code point 228 still resembling a stroked d (đ)), but with 32 additional graphical symbols at code points 128 to 159, including a rich set ofbox-drawing characters.[5][6]
In 1986,[66] Hewlett-Packard introduced theHP-18C calculator and HP 82240Athermo printer,[66] which internally used an extended variant of the 1985 revision of the 8-bitHP Roman-8 character set (now with the six additional characters defined and with code point 228 already changed to an eth (ð)), but with the code points 127 (0x7F) and 160 (0xA0) as well as the control codes in the range 128 to 159 (0x80 to 0x9F) being replaced by additional displayable characters,[7][67]some of which were derived from theHP-41C/CV/CX'sFOCAL character set and others incorporated into therevised FOCAL character set used by theHP-42S calculator, although at different code points. On theHP-28 series, characters above 147 (0x93) could not be displayed on the calculator, only be printed.[67][7][68]
^The glyph at code point 148 (⊦) in this variant of HP Roman-8 is called APPEND character in Hewlett-Packard terminology. In theHP-41C/CV/CX andHP-42S series of calculators, it is used to indicate that the following characters will be appended to the alpha register rather than replacing the existing contents of the register. However, these calculators use theFOCAL character set, where the glyph is located at code point 127.
HP Roman-9 (also known asHP Roman 9,hp-roman9,roman9 orR9) is a slight modification of the 8-bitHP Roman-8 character set where thegeneral currency sign (¤) at code point 186 (0xBA) was replaced by theeuro sign (€).[70][71] It was introduced in early 1999.[8] As of 2017, HP Roman-9 still has no known code page number assigned to it.
^abHewlett-Packard LaserJet Printer Family Soft Font Selection Guide(PDF).Hewlett-Packard. November 1986. HP-ID 26026-90924.Archived(PDF) from the original on 17 August 2016. Retrieved17 August 2016.Roman-8 & Roman Extension: The Roman-8 and Roman Extension characters reflect the 1985 updates: 1. Six additional characters added to former blank positions.Ýý·µ¶¾ 2. Appearance of the lowercaseđ character changed toð.
^abcdeHP 82240A Infrared Printer(PDF) (2 ed.). Corvallis, OR, USA:Hewlett Packard, Portable Computer Division. October 1986. HP reorder number 82240-90001 (82240-90008).Archived(PDF) from the original on 6 August 2016. Retrieved6 August 2016.
^abDavis, Steve (1 September 1979)."European 2631A Users Beware!"(PDF).Computer Systems Newsletter - for HP Field Personnel.4 (20).Hewlett-Packard: 4.Archived(PDF) from the original on 17 August 2016. Retrieved17 August 2016.009 - Roman Extension Set ... option 009 gives all the commonly used European characters a unique code. When this "Roman Extension" set is used in conjunction with the standard ASCII character set, the end result is a 256 character set represented by 8-bit code. The use ofshift-in andshift-out characters is not required, since decimal codes 0 through 127 automatically accessUSASCII while codes 128 through 255 access the European characters. It is important to understand the differences between the two techniques and to know which technique is supported on a given system. TheHP 250 andHP 300 support the 8-bit code technique, consequently,2631A option 009 must be ordered to provide local language printing on these two systems. All other HP computer systems and the 264X terminals support the 7-bit code, shift-in/shift-out method.
^Allen, Julie D., ed. (August 2015) [1991].The Unicode Standard - Version 8.0 - Core Specification - Chapter 22.1. Currency Symbols(PDF). Mountain View, CA, USA:Unicode, Inc. pp. 751–752.ISBN978-1-936213-10-8.Archived(PDF) from the original on 6 December 2016. Retrieved6 December 2016.Currency Symbols: U+20A0–U+20CF ... Lira Sign. A separate currency sign U+20A4 LIRA SIGN is encoded for compatibility with theHP Roman-8 character set, which is still widely implemented in printers. In general, U+00A3 POUND SIGN may be used for both the various currencies known as pound (or punt) and the currencies known as lira.
^abcPCL 5 Printer Language Technical Reference Manual (1st ed.).Hewlett-Packard Company. September 1990. pp. A-1, A-6,A-11 –A-18. HP Part No. 33459-90903.
^https://github.com/unicode-org/icu/blame/773345e418e012ed31499def440b89cb7ef5c188/icu4c/data/ibm-1051.ucm This file has remained essentially unchanged since it was imported fromcdctables.zip in 1995; only a few flags have been changed: In July 2000 some characters were marked as mapping only from Unicode, but not back: ` \xA9 # SD130100 → ` \xA9 # SD130100 |1 £ \xAF # SC020001 → £ \xAF # SC020001 |1 ▒ \x7F # SF150000 → ▒ \x7F # SF150000 |1 Fullwidth ! \x21 # SP020000 → Fullwidth ! \x21 # SP020000 |1 Fullwidth " \x22 # SP040000 → Fullwidth " \x22 # SP040000 |1 ... Fullwidth } \x7D # SM140000 → Fullwidth } \x7D # SM140000 |1 Fullwidth ~ \x7E # SD190000 → Fullwidth ~ \x7E # SD190000 |1 Fullwidth ■ \xFC # SM470000 → Fullwidth ■ \xFC # SM470000 |1 These are all cases where multiple characters map to the same byte. In November 2000 some characters were marked as mapping only to Unicode, but not back: - \xF6 |0 → - \xF6 |3 ` \xA9 |1 → ` \xA9 |3 £ \xAF |1 → £ \xAF |3 These are all cases where multiple bytes map to the same character.
^Nungester, Rick (18 August 1988)."Infra-Red output converter". Luc Pauwels (published 24 October 2006).Archived from the original on 6 August 2016. Retrieved6 August 2016.