Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Unicode subscripts and superscripts

Page semi-protected
From Wikipedia, the free encyclopedia
(Redirected from)
Unicode denominator & numerator glyphs

This article containsspecial characters. Without properrendering support, you may seequestion marks, boxes, or other symbols.

Unicode hassubscripted and superscripted versions of a number of characters including a full set ofArabic numerals.[1] These characters allow anypolynomial,chemical and certain otherequations to be represented in plain text without using any form ofmarkup likeHTML orTeX.

TheWorld Wide Web Consortium and theUnicode Consortium have made recommendations on the choice between using markup and using superscript and subscript characters:

When used in mathematical context (MathML) it is recommended to consistently use style markup for superscripts and subscripts […] However, when super and sub-scripts are to reflect semantic distinctions, it is easier to work with these meanings encoded in text rather than markup, for example, inphonetic orphonemic transcription.[2]

Uses

The difference between superscript/subscript and numerator/denominator glyphs. In many popularcomputer fonts the Unicode "superscript" and "subscript" characters are actually numerator and denominator glyphs.

Theintended use[2] when these characters were added to Unicode was to produce true superscripts and subscripts so that chemical and algebraic formulas could be written without markup. Thus "H₂O" (using a subscript 2 character) issupposed to be identical to "H2O" (with subscript markup).

In reality, many fonts that include these characters ignore the Unicode definition, and instead design the digits for mathematicalnumerator anddenominator glyphs,[3][4] which are aligned with thecap line and thebaseline, respectively. When used with thesolidus or the Fraction Slash, they produce an almost typographically correct diagonal fraction, such as ³/₄ for the ¾ glyph. Super and subscript markup does not produce a correct fraction (compare markup3/4 with precomposed ¾). The change also makes the superscript letters useful forordinal indicators, more closely matching the ª and º characters.

Unicode intended that diagonal fractions be rendered by a different mechanism: thefraction slash U+2044 is visually similar to the solidus, but when used with the ordinary digits (not the superscripts and subscripts), it instructs the layout system that a fraction such as ¾ is to be rendered using automatic glyph substitution.[5][a] User-end support was quite poor for a number of years, but fonts,[b] browsers,[c] word processors,[d] desktop publishing software[e] and others increasingly support the intended Unicode behavior. This browser and your default font render it as 3⁄4. (SeeSlash (punctuation)#Fractions for rendering in various other fonts.)

Superscripts and subscripts block

Main article:Superscripts and Subscripts (Unicode block)

The most common superscript digits (1, 2, and 3) were included inISO-8859-1 and were therefore carried over into those code points in the Latin-1 range of Unicode. The remainder were placed along with basic arithmetical symbols, and later some Latin subscripts, in a dedicated block atU+2070 to U+209F. The table below shows these characters together. Each superscript or subscript character is preceded by a baselinex to show the height of subscripting/superscripting.

Six code points in the "Superscripts and Subscripts" block are unassigned, and remain available for future characters. As of November 2024,[ref] three of these (209D, 209E, and 209F) were provisionally assigned to new subscript characters, namely Latin lowercase w, y, and z.[6][7]

Unicode characters
0123456789ABCDEF
U+00Bx
U+207xx⁰xⁱx⁴x⁵x⁶x⁷x⁸x⁹x⁺x⁻x⁼x⁽x⁾xⁿ
U+208xx₀x₁x₂x₃x₄x₅x₆x₇x₈x₉x₊x₋x₌x₍x₎
U+209xxₐxₑxₒxₓxₔxₕxₖxₗxₘxₙxₚxₛxₜ
  Not yet assigned.
  Other characters from Latin-1 not related to super- or sub-scripts.

Other superscript and subscript characters

Unicode also includes codepoints for subscript and superscript characters that are intended for semantic usage, in the following blocks:[1][8]

Superscript
  • TheLatin-1 Supplement block contains the feminine and masculineordinal indicators ª and º.
  • TheLatin Extended-C block contains one superscript, ⱽ.
  • TheLatin Extended-D block contains six superscripts: ꝰ ꟲ ꟳ ꟴ ꟸ ꟹ.
  • TheLatin Extended-E block contains five superscripts: ꭜ ꭝ ꭞ ꭟ ꭩ.
  • TheLatin Extended-F block is entirelysuperscript IPA letters: 𐞁 𐞂 𐞃 𐞄 𐞅 𐞇 𐞈 𐞉 𐞊 𐞋 𐞌 𐞍 𐞎 𐞏 𐞐 𐞑 𐞒 𐞓 𐞔 𐞕 𐞖 𐞗 𐞘 𐞙 𐞚 𐞛 𐞜 𐞝 𐞞 𐞟 𐞠 𐞡 𐞢 𐞣 𐞤 𐞥 𐞦 𐞧 𐞨 𐞩 𐞪 𐞫 𐞬 𐞭 𐞮 𐞯 𐞰 𐞲 𐞳 𐞴 𐞵 𐞶 𐞷 𐞸 𐞹 𐞺.
  • TheSpacing Modifier Letters block has superscripted letters and symbols used for phonetic transcription: ʰ ʱ ʲ ʳ ʴ ʵ ʶ ʷ ʸ ˀ ˁ ˠ ˡ ˢ ˣ ˤ.
  • ThePhonetic Extensions block has several superscripted letters and symbols: Latin/IPA ᴬ ᴭ ᴮ ᴯ ᴰ ᴱ ᴲ ᴳ ᴴ ᴵ ᴶ ᴷ ᴸ ᴹ ᴺ ᴻ ᴼ ᴽ ᴾ ᴿ ᵀ ᵁ ᵂ ᵃ ᵄ ᵅ ᵆ ᵇ ᵈ ᵉ ᵊ ᵋ ᵌ ᵍ ᵏ ᵐ ᵑ ᵒ ᵓ ᵖ ᵗ ᵘ ᵚ ᵛ, Greek ᵝ ᵞ ᵟ ᵠ ᵡ, Cyrillic ᵸ, other ᵎ ᵔ ᵕ ᵙ ᵜ. These are intended to indicatesecondary articulation.
  • ThePhonetic Extensions Supplement block has several more: Latin/IPA ᶛ ᶜ ᶝ ᶞ ᶟ ᶠ ᶡ ᶢ ᶣ ᶤ ᶥ ᶦ ᶧ ᶨ ᶩ ᶪ ᶫ ᶬ ᶭ ᶮ ᶯ ᶰ ᶱ ᶲ ᶳ ᶴ ᶵ ᶶ ᶷ ᶸ ᶹ ᶺ ᶻ ᶼ ᶽ ᶾ, Greek ᶿ.
  • TheCyrillic Extended-B block contains twoCyrillic superscripts: ꚜ ꚝ.
  • TheCyrillic Extended-D block contains many Cyrillic superscripts: 𞀰 𞀱 𞀲 𞀳 𞀷 𞀵 𞀶 𞀷 𞀸 𞀹 𞀺 𞀻 𞀼 𞀽 𞀾 𞀿 𞁀 𞁁 𞁂 𞁃 𞁄 𞁅 𞁆 𞁇 𞁈 𞁉 𞁊 𞁋 𞁌 𞁍 𞁎 𞁏 𞁐 𞁫 𞁬 𞁭.
  • TheGeorgian block contains one superscriptedMkhedruli letter: ჼ.
  • TheKanbun block has superscripted annotation characters used in Japanese copies ofClassical Chinese texts: ㆒ ㆓ ㆔ ㆕ ㆖ ㆗ ㆘ ㆙ ㆚ ㆛ ㆜ ㆝ ㆞ ㆟.
  • TheTifinagh block has one superscript letter : ⵯ.
  • TheUnified Canadian Aboriginal Syllabics and itsExtended blocks contain several mostlyconsonant-only letters to indicatesyllable coda called Finals, along with some characters that indicatesyllable medial known as Medials: Main block ᐜ ᐝ ᐞ ᐟ ᐠ ᐡ ᐢ ᐣ ᐤ ᐥ ᐦ ᐧ ᐨ ᐩ ᐪ ᑉ ᑊ ᑋ ᒃ ᒄ ᒡ ᒢ ᒻ ᒼ ᒽ ᒾ ᓐ ᓑ ᓒ ᓪ ᓫ ᔅ ᔆ ᔇ ᔈ ᔉ ᔊ ᔋ ᔥ ᔾ ᔿ ᕀ ᕁ ᕐ ᕑ ᕝ ᕪ ᕻ ᕯ ᕽ ᖅ ᖕ ᖖ ᖟ ᖦ ᖮ ᗮ ᘁ ᙆ ᙇ ᙚ ᙾ ᙿ; Extended block: ᣔ ᣕ ᣖ ᣗ ᣘ ᣙ ᣚ ᣛ ᣜ ᣝ ᣞ ᣟ ᣳ ᣴ ᣵ.
Combining superscript
  • TheCombining Diacritical Marks block contains medieval superscript letter diacritics. These letters are written directly above other letters appearing in medieval Germanic manuscripts, and so these glyphs do not include spacing, for example uͤ. They are shown here over thedotted circle placeholder ◌: ◌ͣ ◌ͤ ◌ͥ ◌ͦ ◌ͧ ◌ͨ ◌ͩ ◌ͪ ◌ͫ ◌ͬ ◌ͭ ◌ͮ ◌ͯ.
  • TheCombining Diacritical Marks Extended block contains three combininginsular letters for the Middle EnglishOrmulum, ◌ᫌ ◌ᫍ ◌ᫎ.[9]
  • TheCombining Diacritical Marks Supplement block contains additional medieval superscript letter diacritics, enough to complete the basic lowercase Latin alphabet except for j, q and y, a few small capitals and ligatures (ae, ao, av), and additional letters: ◌᷒ ◌ᷓ ◌ᷔ ◌ᷕ ◌ᷖ ◌ᷗ ◌ᷘ ◌ᷙ ◌ᷚ ◌ᷛ ◌ᷜ ◌ᷝ ◌ᷞ ◌ᷟ ◌ᷠ ◌ᷡ ◌ᷢ ◌ᷣ ◌ᷤ ◌ᷥ ◌ᷦ ◌ᷧ ◌ᷨ ◌ᷪ ◌ᷫ ◌ᷬ ◌ᷭ ◌ᷮ ◌ᷯ ◌ᷰ ◌ᷱ ◌ᷲ ◌ᷳ ◌ᷴ, Greek ◌ᷩ.
  • TheCyrillic Extended-A and -B blocks contains multiple medieval superscript letter diacritics, enough to complete the basic lowercase Cyrillic alphabet used in Church Slavonic texts, also includes an additional ligature (ст): ◌ⷠ ◌ⷡ ◌ⷢ ◌ⷣ ◌ⷤ ◌ⷥ ◌ⷦ ◌ⷧ ◌ⷨ ◌ⷩ ◌ⷪ ◌ⷫ ◌ⷬ ◌ⷭ ◌ⷮ ◌ⷯ ◌ⷰ ◌ⷱ ◌ⷲ ◌ⷳ ◌ⷴ ◌ⷵ ◌ⷶ ◌ⷷ ◌ⷸ ◌ⷹ ◌ⷺ ◌ⷻ ◌ⷼ ◌ⷽ ◌ⷾ ◌ⷿ ◌ꙴ ◌ꙵ ◌ꙶ ◌ꙷ ◌ꙸ ◌ꙹ ◌ꙺ ◌ꙻ ◌ꚞ ◌ꚟ.
  • TheCyrillic Extended-D block has one additional combining character, that being і: ◌𞂏.
Subscript
  • TheLatin Extended-C block contains one subscript, ⱼ.
  • ThePhonetic Extensions block has several subscripted letters and symbols: Latin/IPA ᵢ ᵣ ᵤ ᵥ and Greek ᵦ ᵧ ᵨ ᵩ ᵪ.
  • TheCyrillic Extended-D block also contains many Cyrillic subscripts: 𞁑 𞁒 𞁓 𞁔 𞁕 𞁖 𞁗 𞁘 𞁙 𞁚 𞁛 𞁜 𞁝 𞁞 𞁟 𞁠 𞁡 𞁢 𞁣 𞁤 𞁥 𞁦 𞁧 𞁨 𞁩 𞁪.
Combining subscript

Latin, Greek, Cyrillic, and IPA tables

A superscript small-capW may be distinct from a superscript lowercasew in italic typeface, as in this phonetic notation.

Consolidated, the Unicode standard contains superscript and subscript versions of a subset of Latin, Greek and Cyrillic letters. Here they are arranged in alphabetical order for comparison (or for copy and paste convenience). Since these characters appear in different Unicode ranges, they may not appear to be the same size or position due to font substitution by the browser. Shaded cells mark petite capitals that are not very distinct from minuscules in roman typeface, but they may be distinct in italic typeface, as is used in some phonetic notation.

Little punctuation is encoded. Parentheses are shown in the basic superscript block above, and the exclamation mark ⟨⟩ is shown in the IPA table below. In a supporting font, a question mark may be created with a superscript gelded question mark and a combining dot below: ⟨ˀ̣⟩.

Latin superscript and subscript letters
ABCDEFGHIJKLMNOPQRSTUVWXYZ
Superscript capitalᴿ*
Superscript small capital*𐞄**𐞒𐞖*𐞪𐞲
Superscript minusculeʰʲˡ𐞥ʳˢʷˣʸ
Overscript small capital◌ᷛ◌ᷞ◌ᷟ◌ᷡ◌ᷢ
Overscript minuscule◌ͣ◌ᷨ◌ͨ◌ͩ◌ͤ◌ᷫ◌ᷚ◌ͪ◌ͥ◌ᷜ◌ᷝ◌ͫ◌ᷠ◌ͦ◌ᷮ◌ͬ◌ᷤ◌ͭ◌ͧ◌ͮ◌ᷱ◌ͯ◌ᷦ
Subscript minuscule***
Underscript minuscule◌᷊◌ᪿ

*Superscript versions of S, of petite capital A, D, E and P, of ƀ, and subscript versions of w, y and z have been accepted for a future version of the Unicode Standard.[6]

Additional Latin characters
ÆɃƎƏŊ
Superscript capital
Superscript minuscule𐞃*
Overscript minuscule◌ᷔ◌ᷪ
Subscript minuscule

Some of these superscript capitals are small caps in the source documents in the Unicode proposals.

Shaded cells Greek letters that are indistinguishable from Latin, and so would not be expected to be supported by Unicode.

Greek superscript and subscript letters
ΑΒΓΔΕΖΗΘΙΚΛΜΝΞΟΠΡΣΤΥΦΧΨΩ
Superscript minuscule[A]ᶿ[A]**
Overscript minuscule[A]◌ᷩ
Subscript minusculeͺ[f]
Underscript minuscule◌ͅ◌̫[g]
  1. ^abcIn some fonts, Latin alpha ᵅ and upsilon ᶹ can be used as superscript Greek alpha and upsilon. ᵋ and ᶥ are also officially Latin letters, but display the same as Greek.

*Superscript versions of Greek psi and omega have been accepted for a future version of the Unicode Standard.[6]

Russian superscript and subscript letters
АБВГДЕЖЗИКЛМНОПРСТУФХЦЧШЩЪЫЬЭЮЯ
Superscript𞀰𞀱𞀲𞀳𞀴𞀵𞀶𞀷𞀸𞀹𞀺𞀻𞀼𞀽𞀾𞀿𞁀𞁁𞁂𞁃𞁄𞁅𞁆𞁇𞁈𞁉
Overscript◌ⷶ◌ⷠ◌ⷡ◌ⷢ◌ⷣ◌ⷷ◌ⷤ◌ⷥ◌ꙵ◌ⷦ◌ⷧ◌ⷨ◌ⷩ◌ⷪ◌ⷫ◌ⷬ◌ⷭ◌ⷮ◌ꙷ◌ꚞ◌ⷯ◌ⷰ◌ⷱ◌ⷲ◌ⷳ◌ꙸ◌ꙹ◌ꙺ◌ⷻ
Subscript𞁑𞁒𞁓𞁔𞁕𞁖𞁗𞁘𞁙𞁚𞁛𞁜𞁝𞁞𞁟𞁠𞁡𞁢𞁣𞁤𞁥𞁦
Additional modern Cyrillic characters
ӘҐЄЅІЇЈӨҪҮҰЏӀ
Superscript𞁋𞁊𞁌𞁌̈𞁍𞁎𞁫𞁏𞁭𞁐
Overscript◌ꙴ◌𞂏◌ꙶ
Subscript𞁧𞁩𞁨𞁨̈𞁪
Additional medieval Cyrillic characters
ѠѢѤѦѪѬѲ
Overscript◌ⷹ◌ꙻ◌ⷺ◌ⷼ◌ꚟ◌ⷽ◌ⷾ◌ⷿ◌ⷴ
Superscript𞁬

Superscript and subscriptё,ї,й,ў etc. are handled with diacritics,⟨𞀵̈ 𞁌̈ 𞀸̆ 𞁁̆⟩etc. Many of the Cyrillic characters were added to theCyrillic Extended-D block, which was added to the freeGentium Plus andAndika fonts with version 6.2 in February 2023.

See alsosmall caps in Unicode.

Superscript IPA

TheLatin Extended-F block was created for the remainingsuperscript IPA letters. They are supported by the freeGentium Plus andAndika fonts. Additional superscript characters for historical and para-IPA letters have been accepted for future versions of the Unicode Standard.[6]

Consonant letters

The Unicode characters for superscript (modifier) IPA andextIPA consonant letters are as follows. The entireLatin Extended-F block is dedicated to superscript IPA. Characters for sounds with secondary articulation are set off in parentheses and placed below the base letters.

IPA and extIPA consonants, along with superscript variants and their Unicode code points
Bi­labialLabio­dentalDentalAlveolarPost­alveolarRetro­flexPalatalVelarUvularPharyn­gealGlottal
Nasalm ᵐ
1D50
ɱ ᶬ
1DAC
n ⁿ
207F
()
 
 
(ȵ)
ɳ ᶯ
1DAF
ɲ ᶮ
1DAE
ŋ ᵑ
1D51
ɴ ᶰ
1DB0
Plosivep ᵖ
1D56
b ᵇ
1D47
t ᵗ
1D57
(ƫ ᶵ)
1DB5
d ᵈ
1D48
()
 
 
(ȶ)
 
 
(ȡ)
ʈ 𐞯
107AF
ɖ 𐞋
1078B
c ᶜ
1D9C
ɟ ᶡ
1DA1
k ᵏ
1D4F
ɡ ᶢ/g ᵍ
1DA2/1D4D
q 𐞥
107A5
ɢ 𐞒
10792
ʡ 𐞳
107B3
ʔ ˀ
02C0
Affricateʦ 𐞬
107AC
ʣ 𐞇
10787
ʧ 𐞮
107AE
(ʨ𐞫)
107AB
ʤ 𐞊
1078A
(ʥ𐞉)
10789
ꭧ 𐞭
107AD
(𝼜)
ꭦ 𐞈
10788
(𝼙)
Fricativeɸ ᶲ
1DB2
β ᵝ
1D5D
f ᶠ
1DA0
v ᵛ
1D5B
θ ᶿ
1DBF
ð ᶞ
1D9E
s ˢ
02E2
()
z ᶻ
1DBB
()
ʃ ᶴ
1DB4
(ɕ ᶝ)
1D9D
ʒ ᶾ
1DBE
(ʑ ᶽ)
1DBD
ʂ ᶳ
1DB3
()
ʐ ᶼ
1DBC
()
ç ᶜ̧
1D9C + 0327[h]
ʝ ᶨ
1DA8
x ˣ
02E3
(ɧ 𐞗)
10797
ɣ ˠ
02E0
χ ᵡ
1D61
ʁ ʶ
02B6
ħ 𐞕
10795
(ʩ 𐞐)
10790
ʕ ˤ
02E4[i]
h ʰ
02B0
()
ɦ ʱ
02B1
Approximantʋ ᶹ
1DB9
ɹ ʴ
02B4
ɻ ʵ
02B5
j ʲ
02B2
(ɥ ᶣ)
1DA3
 
 
(ʍ ꭩ)
AB69
ɰ ᶭ
1DAD
(w ʷ)
02B7
Tap/flapⱱ 𐞰
107B0
ɾ 𐞩
107A9
ɽ 𐞨
107A8
Trillʙ 𐞄
10784
r ʳ
02B3
ʀ 𐞪
107AA
ʜ 𐞖
10796
ʢ 𐞴
107B4
Lateral fricativeɬ 𐞛
1079B
(ʪ 𐞙)
10799
ɮ 𐞞
1079E
(ʫ 𐞚)
1079A
ꞎ 𐞝
1079D
𝼅 𐞟
1079F
𝼆 𐞡
107A1
𝼄 𐞜
1079C
Lateral approximantl ˡ
02E1
(ᶅ ᶪ)
1DAA
 
 
(ȴ)
ɭ ᶩ
1DA9
ʎ 𐞠
107A0
ʟ ᶫ
1DAB
(ɫ ꭞ)[j]
AB5E
Lateral tap/flapɺ 𐞦
107A6
𝼈 𐞧
107A7
Implosiveƥɓ 𐞅
10785
ƭɗ 𐞌
1078C
𝼉ᶑ 𐞍
1078D
ƈʄ 𐞘
10798
ƙɠ 𐞓
10793
ʠʛ 𐞔
10794
Click releaseʘ 𐞵
107B5
ǀ 𐞶
107B6
ʇǃ ꜝ
A71D
ʗ𝼊 𐞹
107B9
ψǂ 𐞸
107B8
𝼋(ʞ)
Lateral click
release
ǁ 𐞷
107B7
ʖ
Percussive¡ ꜞ
A71E[k]

The spacing diacritic for ejective consonants, U+2BC, works with superscript letters despite not being superscript itself: ⟨ᵖʼᵗʼᶜʼᵏˣʼ⟩. If a distinction needs to be made, the combining apostrophe U+315 may be used: ⟨ᵖ̕ᵗ̕ᶜ̕ᵏˣ̕⟩. The spacing diacritic should be used for a baseline letter with a superscript release, such as[tˢʼ] or[kˣʼ], where the scope of the apostrophe includes the non-superscript letter, but the combining apostrophe U+315 might be used to indicate a weakly articulated ejective consonant like[ᵗ̕] or[ᵏ̕], where the whole consonant is written as a superscript, or together with U+2BC when separate apostrophes have scope over the base and modifier letters, as in ⟨pʼᵏˣ̕⟩.[10]

Spacing diacritics, as in ⟨⟩, cannot be secondarily superscripted in plain text: ⟨ᵗʲ⟩. (In this instance, the old IPA letter for[tʲ], ⟨ƫ⟩, has a superscript variant in Unicode, U+1DB5 ⟨⟩, but that is not generally the case.)

Among older letters, the most common letters withpalatal hook are supported; they are displayed in the table above. IPA once had an idiosyncratic curl on some of the palatalized letters: these are the fricative letters ⟨ʆʓ⟩. Their superscript forms have been accepted for a future version of the Unicode Standard. Old-style click letters and the retired letters ⟨ƞ⟩ and ⟨ɼ⟩ have also been accepted for a future version of the Unicode Standard.[6] The Teuthonista letter ⟨⟩ (U+A727) approximates an old graphic variant of ⟨ɮ⟩. Its superscript is supported at ⟨⟩ (U+AB5C).

Among para-IPA letters, superscript variants of Sinological ⟨ȡȴȵȶ⟩, of the Bantuist labio-dental plosives ⟨ȹ⟩ and ⟨ȸ⟩, and of central semivowels ⟨ɉ⟩, ⟨ɥ̶⟩, and ⟨⟩ have been accepted for a future version of the Unicode Standard.[6]

Vowel letters

The Unicode characters for superscript (modifier) IPA vowel letters, plus a pair of extended letters ⟨ᵿ⟩ found in English dictionaries, are as follows. Recently retired alternative letters such as ⟨ɩɷ⟩ are also supported; they are set off in parentheses and placed below the standard IPA letters:

IPA vowels and superscript variants
FrontCentralBack
Closei ⁱ
2071
y ʸ
02B8
ɨ ᶤ
1DA4
ʉ ᶶ
1DB6
ɯ ᵚ
1D5A
u ᵘ
1D58
Near-closeɪ ᶦ
1DA6
(ɩ ᶥ)
1DA5
ʏ 𐞲
107B2




(ᵻ ᶧ)
1DA7


(ᵿ)



(ω)

ʊ ᶷ
1DB7
(ɷ 𐞤)
107A4
Close-mide ᵉ
1D49
ø 𐞢
107A2
ɘ 𐞎
1078E
ɵ ᶱ
1DB1
ɤ 𐞑
10791
o ᵒ
1D52
Midə ᵊ
1D4A
Open-midɛ ᵋ
1D4B
œ ꟹ
A7F9
ɜ ᶟ
1D9F
(ᴈ ᵌ)
1D4C
ɞ 𐞏
1078F
ʌ ᶺ
1DBA
ɔ ᵓ
1D53
Near-openæ 𐞃
10783
ɶ 𐞣
107A3
ɐ ᵄ
1D44
ɑ ᵅ
1D45
ɒ ᶛ
1D9B
Opena ᵃ
1D43

The precomposed Unicode rhotic vowel letters ⟨ɚɝ⟩ are not directly supported. The rhotic diacritic U+02DE◌˞ should be used instead: ⟨ᵊ˞ ᶟ˞⟩.[11]

ɜ⟩ and ⟨⟩ arereversedɛ. The older IPAturnedɛ, ⟨⟩, is also supported, at U+1D4C ⟨⟩. However, the briefly resurrected vowel letter ⟨ʚ⟩ (U+029A) is not supported, only its reversed replacement ⟨ɞ⟩ is.

Among older letters, ⟨⟩ (U+1D1C), a graphic variant of ⟨ʊ⟩, is supported at ⟨⟩ (U+1DB8).[12]

Among para-IPA letters, Sinological superscript ⟨ɿʅʮʯ ⟩ have been accepted for a future version of the Unicode Standard.[6]

Length marks

The two length marks are also supported:

Length marks
LongHalf-long
ː 𐞁
10781
ˑ 𐞂
10782

These are used to add length to another superscript, such as ⟨Cʰ𐞁⟩ or ⟨Cʰ𐞂⟩ for long aspiration.

Wildcards

Superscript wildcards (full caps) are largely supported: e.g.ᴺC (prenasalized consonant),ꟲN (prestopped nasal),Pꟳ (fricative release),NᴾF (epenthetic plosive),CVNᵀ (tone-bearing syllable),Cᴸ (liquid or lateral release),Cᴿ (rhotic or resonant release),Vᴳ (off-glide/diphthong),Cⱽ (fleeting vowel). SuperscriptS for sibilant release has been accepted for a future version of the Unicode Standard;[6] superscript for fleeting/epenthetic click has not. Other basic Latin superscript wildcards for tone and weak indeterminate sounds, as described in the article on theInternational Phonetic Alphabet, are mostly supported. (See table in previous section.)

Combining marks and subscripts

In addition, a very few IPA letters beyond the basic Latin alphabet have combining forms or are supported as subscripts:

Additional IPA characters
äɑæβçðəʃʍχʔʼ
Overscript◌ᷲ◌ᷧ◌ᷔ◌ᷩ◌ᷗ◌ᷙ◌ᷪ◌ᷯ◌̉[l]◌̓
Subscript
Underscript◌ᫀ◌̦

Composite characters

Primarily for compatibility with earlier character sets, Unicode contains a number of characters that compose super- and subscripts with other symbols.[1] In most fonts these render much better than attempts to construct these symbols from the above characters or by using markup.

Notes

  1. ^For a general overview and technical information on glyph substitution (though not specifically for fractions), seeGSUB — Glyph Substitution Table in theOpenType specification on theMicrosoft Typography site.
  2. ^Such asAndika,Arno Pro,Brill,Brioso Pro,Calibri,Candara,Carlito,Cantarell,FiraGO,EB Garamond,Gentium Book,Lato,Linux Libertine,Noto Sans,Noto Serif,Open Sans andYrsa
  3. ^Such asChrome,Firefox andFalkon
  4. ^Such asLibreOffice Writer
  5. ^Such asAdobe InDesign andScribus
  6. ^ͺ⟩ is set lower than a normal subscript. It is equivalent to underscript ⟨◌ͅ⟩ on a space.
  7. ^◌̫⟩ is traditionally typeset as an omega.
  8. ^Superscript ⟨ç⟩ is composed of superscriptc and acombiningcedilla, which should display properly in a good font. Superscript c was specifically requested for this purpose in Unicode proposal L2/03-180.
  9. ^U+02E4 ˤMODIFIER LETTER SMALL REVERSED GLOTTAL STOP is the superscript variant ofU+0295 ʕLATIN LETTER PHARYNGEAL VOICED FRICATIVE and is defined for IPA use. The similar characterU+02C1 ˁMODIFIER LETTER REVERSED GLOTTAL STOP is a reversedU+02C0 ˀMODIFIER LETTER GLOTTAL STOP, perhaps a gelded reversed question mark. Fonts are inconsistent in whether they look different and what the difference is.
  10. ^In Microsoft fonts, superscript ⟨ɫ⟩ was erroneously designed as a superscript ⟨⟩.
  11. ^U+A71D ⟨⟩ and A71E ⟨⟩ were adopted as the Africanist equivalents of the IPA characters ⟨downstep and ⟨upstep. The correspondence of U+A71D ⟨⟩ to the IPA click letter ⟨ǃ⟩ is thus accidental. Coincidentally, U+A71E ⟨⟩ serves as the superscript variant of the extIPApercussive consonant¡⟩; the other percussive letters, ⟨ʬ⟩ and ⟨ʭ⟩, do not have superscript support in Unicode.
  12. ^This is actually the Vietnamese diacriticdấu hỏi, not specifically IPA, but graphically both are gelded question marks.

References

  1. ^abc"UCD: UnicodeData.txt".The Unicode Standard. RetrievedMay 14, 2016.
  2. ^abMartin Dürst, Asmus Freytag (May 16, 2007)."Unicode in XML and other Markup Languages". W3C. RetrievedSeptember 13, 2010.
  3. ^"fraction | Dart Package".Dart packages. December 27, 2021. RetrievedSeptember 21, 2022.
  4. ^"MathML | General layout elements | Fractions".data2type GmbH (in German). March 30, 2021. Archived fromthe original on January 28, 2021. RetrievedJanuary 13, 2022.
  5. ^Martin Dürst, Asmus Freytag (May 16, 2007)."Fraction Slash". W3C. RetrievedSeptember 13, 2010.
  6. ^abcdefgh"Additional draft repertoire for provisionally assigned code points for Unicode"(PDF).Unicode Consortium. November 26, 2024.
  7. ^"Approved Minutes of UTC Meeting 181".Unicode Consortium. January 27, 2025. RetrievedMarch 8, 2025.
  8. ^"UCD: Scripts.txt".The Unicode Standard. RetrievedSeptember 21, 2022.
  9. ^Everson, Michael; West, Andrew (October 5, 2020)."L2/20-268: Revised proposal to add ten characters for Middle English to the UCS"(PDF).
  10. ^Kirk Miller & Michael Ashby,L2/20-253R Unicode request for IPA modifier letters (b), non-pulmonic.
  11. ^Kirk Miller & Michael Ashby,L2/20-252R Unicode request for IPA modifier-letters (a), pulmonic
  12. ^Kirk Miller (January 30, 2024)."L2/24-081: Latin Phonetic The for Middle Tilde"(PDF).
  13. ^Silva, Eduardo Marín (March 1, 2017)."L2/17-066R: Proposal to encode the Marca Registrada sign"(PDF).
Unicode
Code points
Characters
Special purpose
Lists
Processing
Algorithms
Comparison of encodings
On pairs of
code points
Usage
Related standards
Related topics
Scripts and symbols in Unicode
Common and
inherited scripts
Modern scripts
Ancient and
historic scripts
Notational scripts
Symbols, emojis
Retrieved from "https://en.wikipedia.org/w/index.php?title=Unicode_subscripts_and_superscripts&oldid=1282541163"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp