Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Cork encoding

From Wikipedia, the free encyclopedia
Latin script character encoding used by LaTeX
This articlerelies largely or entirely on asingle source. Relevant discussion may be found on thetalk page. Please helpimprove this article byintroducing citations to additional sources.
Find sources: "Cork encoding" – news ·newspapers ·books ·scholar ·JSTOR
(November 2012)

TheCork (also known asT1 orEC) encoding is acharacter encoding used for encodingglyphs infonts.[1] It is named after the city ofCork inIreland, where during aTeX Users Group (TUG) conference in 1990 a new encoding was introduced forLaTeX.[1] It contains 256 characters supporting most west- and east-European languages with theLatin alphabet.[2]

Details

[edit]

In 8-bitTeX engines the font encoding has to match the encoding ofhyphenation patterns where this encoding is most commonly used.[3] InLaTeX one can switch to this encoding with\usepackage[T1]{fontenc}, while inConTeXt MkII this is the default encoding already. In modern engines such asXeTeX andLuaTeX Unicode is fully supported and the 8-bit font encodings are obsolete.

Character set

[edit]
Cork encoding
0123456789ABCDEF
0x`
0060
´
00B4
ˆ
02C6
˜
02DC
¨
00A8
˝
02DD
˚
02DA
ˇ
02C7
˘
02D8
¯
00AF
˙
02D9
¸
00B8
˛
02DB

201A

2039

203A
1x
201C

201D

201E
«
00AB
»
00BB

2013

2014
ZWSP[a]
200B
[b]
2080
ı[c]
0131
ȷ[c]
0237

FB00

FB01

FB02

FB03

FB04
2x SP !"#$%&
2019
()*+,-./
3x0123456789:;<=>?
4x@ABCDEFGHIJKLMNO
5xPQRSTUVWXYZ[\]^_
6x
2018
abcdefghijklmno
7xpqrstuvwxyz{|}~SHY[d]
8xĂ
0102
Ą
0104
Ć
0106
Č
010C
Ď
010E
Ě
011A
Ę
0118
Ğ
011E
Ĺ
0139
Ľ
013D
Ł
0141
Ń
0143
Ň
0147
Ŋ
014A
Ő
0150
Ŕ
0154
9xŘ
0158
Ś
015A
Š
0160
Ş
015E
Ť
0164
Ţ
0162
Ű
0170
Ů
016E
Ÿ
0178
Ź
0179
Ž
017D
Ż
017B
IJ
0132
İ
0130
đ
0111
§
00A7
Axă
0103
ą
0105
ć
0107
č
010D
ď
010F
ě
011B
ę
0119
ğ
011F
ĺ
013A
ľ
013E
ł
0142
ń
0144
ň
0148
ŋ
014B
ő
0151
ŕ
0155
Bxř
0159
ś
015B
š
0161
ş
015F
ť
0165
ţ
0163
ű
0171
ů
016F
ÿ
00FF
ź
017A
ž
017E
ż
017C
ij
0133
¡
00A1
¿
00BF
£
00A3
CxÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏ
DxÐ[e]ÑÒÓÔÕÖŒ
0152
ØÙÚÛÜÝÞSS[f]
1E9E
Exàáâãäåæçèéêëìíîï
Fxðñòóôõöœ
0153
øùúûüýþß
00DF

Notes

[edit]
  • Hexadecimal values under the characters in the table are the Unicode character codes.
  • The first 12 characters are often used ascombining characters.
  1. ^0x17 is dubbed a “compound word mark” (CWM) in the Cork encoding, and is an innovation of this standard. It is an invisible character that separates compounds in a complex word, for instance in German, in order to disallow esthetic ligatures at compound boundaries.[2] It is mapped to the Unicode “zero-width space” (ZWSP, U+200B), defined at about the same time, whose purpose is similar, if not identical.
  2. ^0x18 is a “small o”, used to compose or (or arbitrary smaller quantities) out ofpercent sign (%).[2]
  3. ^abDotless i anddotless j may be used to compose accented variants likei with macron (ī).
  4. ^0x7F is the hyphenation character, not really asoft hyphen (SHY) as defined by Unicode.
  5. ^0xD0 is used both asEth (Ð, U+00D0) and asD with stroke (Đ, U+0110) which might be a problem at some occasions (like copying text from PDF, hyphenation, ...)
  6. ^0xDF contains SS (two lettersS). It allows TeX to automatically convert the German lowercaseß into the uppercase form.

Supported languages

[edit]

The encoding supports most European languages written in Latin alphabet. Notable exceptions are:

Languages with slightly suboptimal support include:

References

[edit]
  1. ^abPetrlik, Lukas (1996-06-19)."The Czech and Slovak Character Encoding Mess Explained".cs-encodings-faq. 1.10.Archived from the original on 2016-06-21. Retrieved2016-06-21.
  2. ^abcFerguson, Michael (1990),"Report on Multilingual Activities"(PDF),TUGboat,11 (4):514–516
  3. ^TeX hyphenation patterns

External links

[edit]
Early telecommunications
ISO/IEC 8859
Bibliographic use
National standards
ISO/IEC 2022
Mac OSCode pages
("scripts")
DOS code pages
IBM AIX code pages
Windows code pages
EBCDIC code pages
DEC terminals (VTx)
Platform specific
Unicode /ISO/IEC 10646
TeX typesetting system
Miscellaneous code pages
Control character
Related topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Cork_encoding&oldid=1228477626"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp