
Estonian orthography is the system used for writing theEstonian language and is based on theLatin alphabet. The Estonianorthography is generally guided by phonemic principles, with eachgrapheme corresponding to onephoneme.
Due toGerman andSwedish influence, the Estonian alphabet (Estonian:eesti tähestik) has the lettersÄ,Ö, andÜ (A, O, and U withdiaeresis), which represent the vowel sounds[æ],[ø] and[y], respectively. Unlike Germanumlauts, they are considered, and alphabetised as, separate letters. The most distinctive letter in the Estonian alphabet, however, is theÕ (O with tilde), which was added to the alphabet in the 19th century byOtto Wilhelm Masing and stands for the vowel[ɤ]. In addition, the alphabet also differs from the Latin alphabet by the addition of the lettersŠ andŽ (S and Z withcaron/háček), and by the position of Z in the alphabet: it has been moved from the end to between S and T (or Š and Ž).
The official Estonian alphabet has 27 letters: A, B, D, E, F, G, H, I, J, K, L, M, N, O, P, R, S, Š, Z, Ž, T, U, V, Õ, Ä, Ö, Ü. The letters F, Š, Z, Ž are so-called "foreign letters" (võõrtähed), and occur only inloanwords andproper names.
Additionally,C,Q,W,X, andY are "foreign letters" used only in writing foreignproper names. These letters do not occur in anyEstonian words, and thus are not usually considered part of the "Estonian proper" alphabet. Including all the foreign letters, the entire alphabet consists of the following 32 letters:
| Letter | IPA | Name | Notes | Letter | IPA | Name | Notes | ||
|---|---|---|---|---|---|---|---|---|---|
| A | a | [ɑ] | aa[ɑːː] | Q | q | — | kuu[kuːː] | [a] | |
| B | b | [b] | bee[beːː] | R | r | [r] | err[erːː] orärr[ærːː] | ||
| C | c | — | tsee[tseːː] | [a] | S | s | [s] | ess[esːː] | |
| D | d | [d] | dee[deːː] | Š | š | [ʃ] or[ʃː] | šaa[ʃɑːː] | [b] | |
| E | e | [e] | ee[eːː] | Z | z | [s] | zett[setːː] | [b] | |
| F | f | [f] or[fː] | eff[efːː] | [b] | Ž | ž | [ʒ] | žee[ʒeːː] | [b] |
| G | g | [ɡ] | gee[ɡeːː] | T | t | [t] or[tː] | tee[teːː] | ||
| H | h | [h] | haa[hɑːː] orašš[ɑʃːː] | U | u | [u] | uu[uːː] | ||
| I | i | [i] | ii[iːː] | V | v | [v] | vee[veːː] | ||
| J | j | [j] | jott[jotʲːː] | W | w | — | kaksisvee[kɑk.sisˈveːː] | [a] | |
| K | k | [k] or[kː] | kaa[kɑːː] | Õ | õ | [ɤ] | õõ[ɤːː] | ||
| L | l | [l] | ell[elːː] | Ä | ä | [æ] | ää[æːː] | ||
| M | m | [m] | emm[emːː] | Ö | ö | [ø] | öö[øːː] | ||
| N | n | [n] | enn[enːː] | Ü | ü | [y] | üü[yːː] | ||
| O | o | [o] | oo[oːː] | X | x | — | iks[iksː] | [a] | |
| P | p | [p] or[pː] | pee[peːː] | Y | y | — | igrek[ˈiɡ.rek] orüpsilon[ˈyp.si.lon] | [a] | |
InBlackletter script W was used instead of V. In some reference works in the past, V and W were sorted as if they were one and the same letter.
Johannes Aavik suggested that the letter Ü be replaced by Y, as it has been in theFinnish alphabet.
Double letters are used to write half-long and overlong vowels and consonants, e. g.aa[ɑː] or[ɑːː],nn[nː] or[nːː],kk[kːː]. For more information, see below.
As the distinction between voiced and voiceless plosives is not native to Estonian, the names of the letters 'b', 'd', 'g' may be pronounced[peːː],[teːː],[keːː], so the letters 'b' and 'd' are also namednõrk B ('weak B') andnõrk D ('weak D') to distinguish them fromtugev P ('strong P') andtugev T ('strong T'). About usage of these letters, see below.
Although Estonianorthography is generally guided by phonemic principles, with eachgrapheme corresponding to onephoneme, there are some historical and morphological deviations from this: for example the initial letterh in words, preservation of the morpheme indeclension of the word (writingb,g,d in places wherep,k,t is pronounced) and in the use ofi andj. Where it is impractical or impossible to typeš andž, they are substituted withsh andzh in some written texts, although this is considered incorrect. Otherwise, theh insh represents avoiceless glottal fricative, as inPasha (pas-ha); this also applies to some foreign names.
Some features of the modern Estonian orthography are:
One consonant between two vowels belongs to the following syllable:kala 'fish' is syllabifiedka-la. Consonant combinations are syllabified before the last consonant:linna 'town [gensg]' is syllabifiedlin-na,tutvus 'acquaintance' is syllabifiedtut-vus. Consonant digraphs and trigraphs in foreign names are regarded as single consonants:Manchester is syllabifiedMan-ches-ter. Two vowels usually form a long vowel or a diphthong, e. g.laulu 'song' is syllabifiedlau-lu. However, a hiatus is formed in morpheme bounds, e. g.avaus 'opening' is syllabifieda-va-us as the word is composed from the rootava- and the suffix-us. Combinations of three vowel letters represent a hiatus of a long vowel or a diphthong and another vowel, e. g.põuane 'dry, droughty, arid (lacking rain)' is syllabifiedpõu-a-ne; but some loanwords have a hiatus of a short vowel followed by a long vowel:oaas 'oasis' is syllabifiedo-aas. Compound words are syllabified as combinations of their parts:vanaema 'grandmother' is syllabified asva-na-e-ma as the word is composed fromvana 'old' andema 'mother'. Etymologically compound loanwords and foreign names may be syllabified as compound or simple words:fotograaf 'photographer' is syllabifiedfo-to-graaf orfo-tog-raaf,Petrograd is syllabifiedPet-ro-grad orPet-rog-rad.
These syllabification rules are used for hyphenating words at the end of line, with the additional rule that a single letter is not left on a line.
Loanwords are normally adapted to Estonian spelling:veeb 'web',džäss 'jazz'. However, foreign words and phrases sometimes may be used in the original spelling, such as Latin phrases, Italian musical terms, exotic words. Such citations are typographically emphasized using italics and declined using an apostrophe:croissant'id 'croissants'.
Foreign proper names from Latin-script languages are written in their original spelling:Margaret Thatcher,Bordeaux. Names from non-Latin-script languages are written using either Estonian orthographic transcription or established romanization systems. Some geographical names (and some names of historical personalities, such as monarchs) have traditional Estonian forms (including some adapted spellings such asViin for GermanWien 'Vienna').
Derivations from foreign proper names with the suffixes-lik, -lane, -lus, -ism, -ist usually conserve the spelling of names (e. g.thatcherism,bordeaux'lane), but a few are adapted by established tradition:marksism,darvinism,luterlus. Derivations without suffixes or with other suffixes are adapted to Estonian spelling:njuuton 'newton' (unit of force),haimoriit (maxillary sinusitis, inflammation of theantrum of Highmore),üterbium 'ytterbium' (chemical element), etc.
Expressions such asCelsiuse kraad 'degree Celsius',Cheddari juust 'Cheddar cheese' conserve the spelling of proper names (adding case endings). However, names of plants and animals are usually written in adapted forms, e.g.koloraado mardikas 'Colorado beetle'.
Apostrophe is used when adding case endings to proper names with unusual grapheme-to-phoneme correspondences (such as ending on a consonant orthographically but on a vowel phonetically or vice versa), e.g.Provence'i (genitive ofProvence).
Capital letters are written at the beginning of the first word in a sentence, proper names, and official names functioned as proper names. May be used in the pronounsSina 'you (singular)' andTeie 'you (plural, also used as formal singular)' to show respect.
Names of months, days of the week, holidays, Chinese zodiac years, and titles of people such asprofessor are not capitalized.
Titles of books, films, etc. are written in quotation marks with only the first word and proper names capitalized.
Compound words are written as one word, but they are often composed of genitive+nominative and hard to distinguish from simple word combinations. A compound word is considered a single word and written together when: 1) it has a separate meaning, e. g.peatükk 'chapter' butpea tükk 'part of a head'; 2) it is different from the genitive+nominative combination, e. g.vesiveski (nominative+nominative) 'watermill'; 3)some combinations may be together or separately, but writing them together is preferred in more complex word phrases:erakonna liige 'member of a party' —iga erakonnaliige 'every member of the party'. Rare and long word combinations are typically written separately.
Thehyphen is used: 1) in compounds where one of the parts is a letter (C-vitamiin 'vitamin C'), an initialism (teksti-TV 'text TV'), a foreign citation (nalja-show 'joke show') or a word part (kuni-sõna 'word containingkuni); 2) in compound adjectives where the first part as a proper name; 3) in compound geographical names such asLõuna-Eesti 'South Estonia'; 4) as asuspended hyphen, e.g.kuld- ja hõbeesemed 'gold and silver things' (also in compound words such asekspordi-impordipank 'export-import bank'); 5) in "nominative+ablative" adverbs, e. g.päev-päevalt 'day after day'; 6) indvandva compounds, e. g.isa-ema 'father and mother'; 7) in compound adjectives from word phrases, e.g.katselis-foneetiline 'related to tentative phonetics'; 8) in compound adjectives with coordinating meaning, e.g.eesti-inglise sõnaraamat 'Estonian-English dictionary'; 9) in double names such asUlla-Liisa. It can be optionally used in unusual compounds such askarusmarja-jahukaste 'gooseberry disease'; in compounds with three or four identical letters in a row (e. g.iga-aastane 'yearly',luu-uure 'bone groove'); in compounds with numbers (see below) or with signs (e.g.+-märk '+ sign'); in the construction 'genitive of a proper name + nominative' after another genitive (e. g.Venemaa Euroopa-osa 'European part of Russia'); in the colloquial construction 'genitive of a proper name + noun' instead of 'noun + proper name', e. g.Kuuse-onu instead ofonu Kuusk 'Uncle Kuusk'; in ad hoc compounds such asaega-küll-meeleolu; in words from two-or-more-component proper names, e. g.françois-villon'lik, buenos-aireslane.
The abbreviation period (full stop) may be used, but it is not mandatory. Commonly used abbreviations are usually written without the abbreviation period:t,tn, ortän fortänav 'street';vt forvaata 'see';jpt forja paljud teised 'and many others'. Using the abbreviation period is recommended when an abbreviation may be misread as another word:joon. forjoonis 'figure, draft' butjoon 'line'. If an abbreviation of a word phrase may be mistaken for a word or for another abbreviation, periods are used after every letter but the last one, and spaces are not used:e.m.a forenne meie ajaarvamist butema 'mother',m.a.j formeie ajaarvamise järgi butmaj formajandus 'economy'.
The hyphen is used in some abbreviations of compound words, e. g.ped-dr forpedagoogikadoktor 'doctor of pedagogy',kpt-ltn forkaptenleitnant 'capitan lieutenant', especially in the constructionabbreviation + complete word, such asrb-paneelid forraudbetoonpaneelid 'reinforced concrete panels'.
Numerals may be written in words (üks 'one',kaks 'two',kolm 'three'...) or in figures (1, 2, 3, ...). In Estonian texts, the comma is used as the decimal separator, and the space is used as thousands separator (in financial documents, the point can be used as thousands separator to avoid inserting an extra digit). The point as a separator is used for dates, daytime, prices, and sports results in meters and centimeters. For prices in euros and cents, writing€ 84.95 as well as84,95 € is accepted. Daytime in hours and minutes (24-hour format) may be written using the point or the colon (without spaces):16.15 or16:15; but seconds are separated by the point:16:15.25. The colon with spaces is used for ratios. e. g.2 : 3.
When written in words, numerals with-teist or-teistkümment (11 to 19),-kümmend (tens) and-sada (hundreds) are written together, e.g.viisteist(kümment) 'fifteen',viiskümmend 'fifty',viissada 'five hundred'. Other compound numerals are written separately:kakskümmend viis 'twenty-five'.
For writing ordinal numbers in figures, theordinal dot is used:16. forkuueteistkümnes 'the sixteenth'. In some cases, ordinals are written as Roman numerals (without the ordinal dot). Roman numerals followed by a dot may be used in numbered lists.
Case forms of cardinal and ordinal numerals may be written in the form "figures+case ending" with or without a hyphen:16s or16-s forkuueteistkümnes 'sixteen [inessive]',16ndas or16-ndas forkuueteistkümnendas 'the sixteenth [inessive]'. For case endings beginning with the letterl, the hyphen is mandatory to avoid confusion with the digit 1:16-le forkuueteistkümnele 'sixteen [allative]'. Case endings after figures are not used when a cardinal or ordinal numeral is in a case concordance with a following noun. Likewise, compound words with numbers written in figures may be written with or without the hyphen:60vatine lamp or60-vatine lamp forkuuekümnevatine lamp '60-watt light'.
The period (full stop) is used at the end of sentences, as an ordinal mark and sometimes as an abbreviation mark and as a number separator (see above).
The comma is used for appositions (but appositions in genitive require the comma only before them), for more than one attribute after a determined word, for enumerations (but the serial comma is not used), between coordinated or subordinated clauses, between direct speech and author's words, before and after parenthetic or vocative phrases, and before and after some other constructions. It is also used between placenames and dates in the nominative case (but not in locative cases); between a surname and a given name, if they are written in this order; before parts of and address; and as a decimal mark.
The colon is used before lists, before direct speech, before explanations, and also in writing daytime and ratios (see above).
The semicolon is used between weakly related parts of sentences, especially containing commas.
The hyphen is used for writing compound words (see above). It is also used for hyphenating words at the end of line, for declining letters and abbreviations, and optionally for declining acronyms/initialisms, numbers, and symbols.
The dash is used when there appears a generalizing word after an enumeration; instead of the comma for accenting clauses and appositions or for relatively long parenthetical constructions; before words indicating surprise; for slight pauses (interchangeably with the ellipsis); in the meaning "from...to" (instead of the wordkuni); for indicating lines or routes (when in attributive function, the hyphen is also accepted); between coordinated attributes if at least one attribute has a hyphen or a space; between remarks of a dialogue written as one line without author's words; as a marker before enumeration items. The dash is not used to indicate omission of a word that would be repeated.
The exclamation and question marks are used at the end of exclamative and interrogative sentences. Occasionally, they may be parenthesized and written after words within sentences to show doubt or surprise. The exclamation mark is also used for addressing people in letters, e.g.Austatud professor Pirk!. Using the comma or the colon in this case is considered inappropriate.
The quotation marks, written as „ ”, are used for direct speech, citations, scare quotes, and names of books, documents, episodes, enterprises, etc. Names of plant sorts may be written in double or in single quotation marks (looking like apostrophes: ’ ’) and are normally italicized. For cited words and phrases, including words in a linguistic context, quotation marks or italics may be used. Quotation marks are not used in the names of institutions, periodicals, awards, wares, and vehicles.
The apostrophe is used for adding case endings and suffixes to foreign names with unusual grapheme-to-phoneme correspondences and to foreign citations in the original spelling (see above). Sometimes the apostrophe is used for adding case endings and suffixes to Estonian names, to make the original form clear:Metsa’le (allative of the surnameMetsa),mutt'lik (the apostrophe is used to conserve the spelling of the surnameMutt, otherwise the double consonant would become a single consonant). Also, the apostrophe is sometimes used in poetry to indicate omission of a sound:õitsel',mull',sull' instead ofõitsele,mulle,sulle are found inLydia Koidula's poems. Single quotation marks (' ') are used for word meanings in a linguistic context.
The parentheses are used for parenthetical words or sentences, and also for optional parts of words in a linguistic context.
The square brackets are used for citer's notes to citations and for showing pronunciation in linguistic and reference works.
The slash is used for division in fractions and unit symbols, for connecting alternatives, to show line breaks when citing poetry in the single-line format, and for non-calendar years. In practice, it occasionally appears in abbreviations made of more than one word (e. g.õ/a forõppiaasta 'school year'), but this usage is considered nonstandard (correct abbreviation:õa). Spaces are used before and after the slash only if it separates text fragments of more than one word.
The ellipsis is used for slight pauses and for unfinished thoughts. It is surrounded by spaces. Also, the ellipsis is used for bowdlerizing obscene words.
Modern Estonian orthography is based on theNewer Orthography created byEduard Ahrens in the second half of the 19th century based on Finnish orthography. TheOlder Orthography it replaced was created in the 17th century byBengt Gottfried Forselius andJohann Hornung based onstandard German orthography. In the old orthography, single consonants following short vowels were written double even if they are short (kala 'fish' was written askalla) and long vowels in an open syllable were written single (looma 'to create' was written asloma). BeforeOtto Wilhelm Masing introduced the letterõ in the early 19th century, its sound had not been distinguished in writing fromö. Earlier writing in Estonian had by and large used an ad hoc orthography based onLatin andMiddle Low German orthography. Some influences of the standard German orthography — for example, writingW/w instead ofV/v persisted well into the 1930s.
InFraktur typesetting (which was common in Estonian publications before the first half of the 20th century), two kinds of the small letters were distinguished: the shorts and thelongſ. The longſ was used at the beginning and in the middle of syllables, and the shorts was used at the end of syllables. For example:kaſs 'cat' —kasſi 'cat [gen. sg., part. sg.]'.
In the second half of the 20th century, some Estonian words and names were quoted in international publications from Soviet sources, and were often in fact spelled as incorrect back-transliterations from Russian Cyrillic. Such examples of Russian transliteration include the use ofя ("ya") forä (e.g. Pyarnu (Пярну) forPärnu),ы ("y") forõ (e.g., Pylva (Пылва) forPõlva) andю ("yu") forü (e.g., Pyussi (Пюсси) forPüssi).