Polish orthography is the system of writing thePolish language. The language is written using the Polish alphabet, which derives from theLatin alphabet, but includes some additional letters withdiacritics.[1]: 6 The orthography is mostly phonetic, or rather phonemic—the written letters (or combinations of them) correspond in a consistent manner to the sounds, or rather thephonemes, of spoken Polish. For detailed information about the system of phonemes, seePolish phonology.
The diacritics used in the Polish alphabet are thekreska (graphically similar to theacute accent) in the lettersć, ń, ó, ś, ź; thekropka (overdot) in the letterż; the stroke in the letterł; and theogonek ("little tail") in the lettersą, ę. There are 32 letters[1]: 4 (or 35 letters, if the foreign lettersq, v, x are included)[2] in the Polish alphabet: 9vowels and 23 or 26consonants.
| Majuscule forms (also called uppercase or capital letters) | ||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A | Ą | B | C | Ć | D | E | Ę | F | G | H | I | J | K | L | Ł | M | N | Ń | O | Ó | P | (Q) | R | S | Ś | T | U | (V) | W | (X) | Y | Z | Ź | Ż |
| Minuscule forms (also called lowercase or small letters) | ||||||||||||||||||||||||||||||||||
| a | ą | b | c | ć | d | e | ę | f | g | h | i | j | k | l | ł | m | n | ń | o | ó | p | (q) | r | s | ś | t | u | (v) | w | (x) | y | z | ź | ż |
| Name of letters | ||||||||||||||||||||||||||||||||||
| a | ą | be | ce | cie | de | e | ę | ef | gie | ha | i | jot | ka | el | eł | em | en | eń | o | ó zamknięte | pe | ku | er | es | eś | te | u | fau | wu | iks | igrek | zet | ziet | żet |

The lettersq (namedku),v (namedfau or rarelywe[3]), andx (namediks) are used in some foreign words and commercial names. Inloanwordsq andv are often replaced bykw andw, respectively, andx byks orgz (as inkwarc "quartz",weranda "veranda",ekstra "extra",egzosfera, "exosphere").
When giving the spelling of words, certain letters may be said in more emphatic ways to distinguish them from other identically pronounced characters. For example, H may be referred to assamo h ("h alone") to distinguish it from CH (ce ha). The letter Ż may be called"żet(or zet) z kropką" ("Ż with a dot") to distinguish it from RZ (er zet). The letter U may be calledu otwarte ("open u", a reference to its graphical form) oru zwykłe ("regular u"), to distinguish it from Ó, which is sometimes calledó zamknięte ("closed ó"),ó kreskowane oró z kreską ("ó with a stroke accent"), alternativelyo kreskowane oro z kreską ("o with a stroke accent"). The letteró is a relic from hundreds of years ago when there was a length distinction in Polish similar to that inCzech, withá andé also being common at the time. Subsequently, the length distinction disappeared andá andé were abolished, butó came to be pronounced the same asu.
Note that Polish letters withdiacritics are treated as fully independent letters in alphabetical ordering (unlike in languages such asFrench,Spanish, andGerman). For example,być comes afterbycie. The diacritic letters also have their own sections in dictionaries (words beginning withć are not usually listed underc). However, there are no regular words that begin withą orń.
Polish additionally uses thedigraphsch,cz,dz,dź,dż,rz, andsz. Combinations of certain consonants with the letteri before a vowel can be considered digraphs:ci as a positional variant ofć,si as a positional variant ofś,zi as a positional variant ofź, andni as a positional variant ofń (but see a special remark onnibelow); and there is also onetrigraphdzi as a positional variant ofdź. These are not given any special treatment in alphabetical ordering. For example,ch is treated simply asc followed byh, and not as a single letter as inCzech orSlovak (e.g.Chojnice only has its first letter capitalised, and is sorted afterCanki and beforeCieszyn).
| Grapheme | Usual value | Other values |
|---|---|---|
| a | /a/ | |
| ą | /ɔw̃/ | [ɔn],[ɔŋ],[ɔm]; becomes/ɔ/ before/w/ (seebelow) |
| e | /ɛ/ | |
| ę | /ɛw̃/ | [ɛn],[ɛŋ],[ɛm]; becomes/ɛ/ word-finally and before/l/ and/w/ (seebelow) |
| i | /i/ | [j] before a vowel; marks palatalization of the preceding consonant before a vowel (seebelow) |
| o | /ɔ/ | |
| ó | /u/ | |
| u | in certain cases, represents[w] after vowels | |
| y | /ɨ/ | usually transcribed as/ɨ/ butpronounced closer to[ɘ] or[ɪ] |
| Grapheme | Usual value | Voiced or devoiced |
|---|---|---|
| b | /b/ | [p] if devoiced |
| c1 | /t͡s/ | [d͡z] if voiced |
| ć1 | /t͡ɕ/ | [d͡ʑ] if voiced |
| cz | /t͡ʂ/ | [d͡ʐ] if voiced |
| d | /d/ | [t] if devoiced |
| dz1 | /d͡z/ | [t͡s] if devoiced |
| dź1 | /d͡ʑ/ | [t͡ɕ] if devoiced |
| dż | /d͡ʐ/ | [t͡ʂ] if devoiced |
| f | /f/ | [v] if voiced |
| g | /ɡ/ | [k] if devoiced |
| h | /x/ | [ɣ] if voiced2 |
| ch | ||
| j | /j/ | |
| k | /k/ | [ɡ] if voiced |
| l | /l/ | |
| ł | /w/ | |
| m | /m/ | |
| n1 | /n/ | |
| ń1 | /ɲ/ | |
| p | /p/ | [b] if voiced |
| r | /r/ | |
| s1 | /s/ | [z] if voiced |
| ś1 | /ɕ/ | [ʑ] if voiced |
| sz | /ʂ/ | [ʐ] if voiced |
| t | /t/ | [d] if voiced |
| w | /v/ | [f] if devoiced |
| z1 | /z/ | [s] if devoiced |
| ź1 | /ʑ/ | [ɕ] if devoiced |
| ż | /ʐ/ | [ʂ] if devoiced |
| rz3 |
^1 Seebelow for rules regarding spelling of alveolo-palatal consonants.
^2 H may be glottal[ɦ] in a small number of dialects.
^3 Rarely,⟨rz⟩ is not a digraph and represents two separate sounds:
Voiced consonant letters frequently come to represent voiceless sounds (as shown in the above tables). This is due to theneutralization that occurs at the end of words and in certainconsonant clusters; for example, the⟨b⟩ inklub ("club") is pronounced like a⟨p⟩, and the⟨rz⟩ inprze- sounds like⟨sz⟩. Less frequently, voiceless consonant letters can represent voiced sounds; for example, the⟨k⟩ intakże ("also") is pronounced like a⟨g⟩. The conditions for this neutralization are described underVoicing and devoicing in the article on Polish phonology.
The spelling rule for thealveolo-palatal sounds/ɕ/,/ʑ/,/t͡ɕ/,/d͡ʑ/ and/ɲ/ is as follows: before the vowel⟨i⟩ the plain letters⟨s z c dz n⟩ are used; before other vowels the combinations⟨si zi ci dzi ni⟩ are used; when not followed by a vowel the diacritic forms⟨ś ź ć dź ń⟩ are used. For example, the⟨s⟩ insiwy ("grey-haired"), the⟨si⟩ insiarka ("sulphur") and the⟨ś⟩ inświęty ("holy") all represent the sound/ɕ/.
| Sound | Word-finally or before a consonant | Before a vowel other than⟨i⟩ | Before⟨i⟩ |
|---|---|---|---|
| /t͡ɕ/ | ć | ci | c |
| /d͡ʑ/ | dź | dzi | dz |
| /ɕ/ | ś | si | s |
| /ʑ/ | ź | zi | z |
| /ɲ/ | ń | ni | n |
Special attention should be paid to⟨n⟩ before⟨i⟩ plus a vowel. In words of foreign origin the⟨i⟩ causes the palatalization of the preceding consonant⟨n⟩ to/ɲ/, and it is pronounced as/j/. This situation occurs when the corresponding genitive form ends in-nii, pronounced as/ɲji/, not with-ni, pronounced as/ɲi/ (which is a situation typical to the words of Polish origin). For examples, see the table in the next section.
According to one system, similar principles apply to thepalatalized consonants/kʲ/,/ɡʲ/ and/xʲ/, except that these can only occur before vowels. The spellings are thus⟨k g (c)h⟩ before⟨i⟩, and⟨ki gi (c)hi⟩ otherwise. For example, the⟨k⟩ inkim ("whom", instr.) and the⟨ki⟩ inkiedy both represent/kʲ/. In the system without the palatalized velars, they are analyzed as /k/, /ɡ/ and /x/ before /i/ and /kj/, /ɡj/ and /xj/ before other vowels.
Except in the cases mentioned in the previous paragraph, the letter⟨i⟩ if followed by another vowel in the same word usually represents/j/, but it also has the palatalizing effect on the previous consonant. For example,pies ("dog") is pronounced[pʲjɛs] (/pjɛs/). Some words with⟨n⟩ before⟨i⟩ plus a vowel also follow this pattern (see below).In facti is the usual spelling of/j/ between a preceding consonant and a following vowel. The letter⟨j⟩ normally appears in this position only after⟨c⟩,⟨s⟩ and⟨z⟩ if the palatalization effect described above has to be avoided (as inpresja "pressure",Azja "Asia",lekcja "lesson", and the common suffixes-cja "-tion",-zja "-sion":stacja "station",wizja "vision"). The letter⟨j⟩ after consonants is also used in concatenation of two words if the second word in the pair starts with⟨j⟩, e.g.wjazd "entrance" originates fromw +jazd(a). The pronunciation of the sequencewja (inwjazd) is the same as the pronunciation ofwia (inwiadro "bucket").
The ending-ii which appears in the inflected forms of some nouns of foreign origin, which have-ia in the nominative case (always after⟨g⟩,⟨k⟩,⟨l⟩, and⟨r⟩; sometimes after⟨m⟩,⟨n⟩, and other consonants), is pronounced as[ji], with the palatalization of the preceding consonant. For example,dalii (genitive ofdalia "dalia"),Bułgarii (genitive ofBułgaria "Bulgaria"),chemii (genitive ofchemia "chemistry"),religii (genitive ofreligia "religion"),amfibii (genitive ofamfibia "amphibia"). The common pronunciation is[i]. This is why children commonly misspell and write-i in the inflected forms asarmii,Danii or hypercorrectly writeziemii instead ofziemi (words of Polish origin do not have the ending-ii but simple-i, e.g.ziemi, genitive ofziemia).
In some rare cases, however, when the consonant is preceded by another consonant,-ii may be pronounced as[i], but the preceding consonant is still palatalized, for example,Anglii (genitive ofAnglia "England") is pronounced[anɡlʲi]. (The spellingAngli, very frequently met with on the Internet, is simply an error in orthography, caused by this pronunciation.)
A special situation applies to⟨n⟩: it has the full palatalization to[ɲ] before-ii which is pronounced as[ji] – and such a situation occurs only when the corresponding nominative form in-nia is pronounced as[ɲja], not as[ɲa].
For example (pay attention to the upper- and lower-case letters):
| Case | Word | Pronunciation | Meaning | Word | Pronunciation | Meaning |
|---|---|---|---|---|---|---|
| Nominative | dania | /daɲa/ | dishes (plural) | Dania | /daɲja/ | Denmark |
| Genitive | (dań) | (/daɲ/) | (of dishes) | Danii | /daɲji/ | of Denmark |
| Nominative | Mania | /maɲa/ | Mary (diminutive of "Maria") | mania | /maɲja/ | mania |
| Genitive | (Mani) | (/maɲi/) | (of Mary) | manii | /maɲji/ | of mania |
The ending-ji, is always pronounced as/ji/. It appears only afterc,s andz. Pronunciation of it as a simple/i/ is considered a pronunciation error. For example,presji (genitive ofpresja "pressure") is/prɛsji/;poezji (genitive ofpoezja "poetry") is/pɔɛzji/;racji (genitive ofracja "reason") is/rat͡sji/.
The letters⟨ą⟩ and⟨ę⟩, when followed by plosives and affricates, represent an oral vowel followed by a nasal consonant, rather than a nasal vowel. For example,⟨ą⟩ indąb ("oak") is pronounced/ɔm/, and⟨ę⟩ intęcza ("rainbow") is pronounced/ɛn/ (the nasalassimilates with the following consonant). When followed by⟨l⟩ or⟨ł⟩, and in the case of⟨ę⟩, also at the end of words by most speakers (in a situation where the speaker pronounces the vowel nasally, it is nasalized only lightly),[4] these letters are pronounced as just/ɔ/ or/ɛ/.
Apart from the cases in the sections above, there are three sounds in Polish that can be spelt in two different ways, depending on the word. Those result from historical sound changes. The correct spelling can often be deduced from the spelling of other morphological forms of the word or cognates in Polish or in other Slavic languages.
The letter⟨u⟩ represents/w/ in the digraphs⟨au⟩ and⟨eu⟩ in loanwords, for exampleautor, Europa; but not in native words, likenauka, pronounced[naˈu.ka].
There are certain clusters where a written consonant would not normally be pronounced. For example, the⟨ł⟩ in the wordsmógł ("could") andjabłko ("apple") is omitted in ordinary speech.
Names are generally capitalized in Polish as in English. Polish does not capitalize the months and days of the week, nor adjectives and other forms derived from proper nouns (for example,angielski "English").
Titles such aspan ("Mr"),pani ("Mrs/Ms"),lekarz ("doctor"), etc. and their abbreviations are not capitalized, except in written polite address. Second-person pronouns are traditionally capitalized in formal writing (e.g. letters or official emails); so may be other words used to refer to someone directly in a formal setting, likeCzytelnik ("reader", in newspapers or books). Third-person pronouns are capitalized to show reverence, most often in a sacred context.
The following are capitalized:
The following are not capitalized:
The following are capitalized:
Polish punctuation is similar to that of English. However, there are more rigid rules concerning use ofcommas—subordinate clauses are almost always marked off with a comma, while it is normally considered incorrect to use a comma before acoordinating conjunction with the meaning "and" (i,a ororaz).
Abbreviations (but notacronyms orinitialisms) are followed by a period when they end with a letter other than the one which ends the full word. For example,dr has no period when it stands fordoktor, but takes one when it stands for an inflected form such asdoktora andprof. has period because it comes fromprofesor (professor).
Apostrophes are used to mark the elision of the final sound of foreign words not pronounced before Polish inflectional endings, as inHarry'ego ([xaˈrɛɡɔ], genitive ofHarry[ˈxarɨ] – the final[ɨ] is elided in the genitive). However, it is often erroneously used to separate a loanword stem from any inflectional ending, for example,*John'a, which should beJohna (genitive ofJohn; no sound is elided).
Quotation marks are used in different ways: either „ordinary Polish quotes” or «French quotes» (without space) for first level, and ‚single Polish quotes’ or «French quotes» for second level, which gives three styles of nested quotes:
Some older prints have used „such Polish quotes“.
Poles adopted theLatin alphabet in the 12th century. However, that alphabet was ill-equipped to represent certain Polish sounds, such as thepalatal consonants andnasal vowels. Consequently, Polish spelling in theMiddle Ages was highly inconsistent, as different writers used different systems to represent these sounds, For example, in early documents the letterc could signify the sounds now writtenc, cz, k, while the letterz was used for the sounds now writtenz, ż, ś, ź. Writers soon began to experiment withdigraphs (combinations of letters), new letters (φ and ſ, no longer used), and eventuallydiacritics.
The Polish alphabet was one of two major forms of Latin-based orthography developed forSlavic languages, the other beingCzech orthography, characterized bycarons (hačeks), as in the letterč. The other major Slavic languages which are now written in Latin-based alphabets (Slovak,Slovene, andSerbo-Croatian) use systems similar to the Czech.Sorbian spelling is also closer to Czech, though it does include more Polish elements than the aforementioned languages. Polish-based orthographies are used forKashubian and usuallySilesian, both spoken in Poland.
The letterƶ is a historical allograph for ż.[citation needed]
There are several different systems forencoding the Polish alphabet for computers. All letters of the Polish alphabet are included inUnicode, and thus Unicode-based encodings such asUTF-8 andUTF-16 can be used. The Polish alphabet is completely included in theBasic Multilingual Plane of Unicode.ISO 8859-2 (Latin-2),ISO 8859-13 (Latin-7),ISO 8859-16 (Latin-10) andWindows-1250 are popular 8-bit encodings that support the Polish alphabet.
The Polish letters which are not present in theEnglish alphabet use the followingHTML character entities[6] andUnicode codepoints:[7][8]
| Upper case | Ą | Ć | Ę | Ł | Ń | Ó | Ś | Ź | Ż | Ƶ |
|---|---|---|---|---|---|---|---|---|---|---|
| HTML entity | Ą Ą | Ć Ć | Ę Ę | Ł Ł | Ń Ń | Ó Ó | Ś Ś | Ź Ź | Ż Ż | — |
| Unicode | U+0104 | U+0106 | U+0118 | U+0141 | U+0143 | U+00D3 | U+015A | U+0179 | U+017B | U+01B5 |
| Result | Ą | Ć | Ę | Ł | Ń | Ó | Ś | Ź | Ż | — |
| Lower case | ą | ć | ę | ł | ń | ó | ś | ź | ż | ƶ |
|---|---|---|---|---|---|---|---|---|---|---|
| HTML entity | ą ą | ć ć | ę ę | ł ł | ń ń | ó ó | ś ś | ź ź | ż ż | — |
| Unicode | U+0105 | U+0107 | U+0119 | U+0142 | U+0144 | U+00F3 | U+015B | U+017A | U+017C | U+01B6 |
| Result | ą | ć | ę | ł | ń | ó | ś | ź | ż | — |
For other encodings, see the following table. Numbers in the table arehexadecimal.
| Character Set | Ą | Ć | Ę | Ł | Ń | Ó | Ś | Ź | Ż | ą | ć | ę | ł | ń | ó | ś | ź | ż |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ISO 8859-2 | A1 | C6 | CA | A3 | D1 | D3 | A6 | AC | AF | B1 | E6 | EA | B3 | F1 | F3 | B6 | BC | BF |
| Windows-1250 | A5 | C6 | CA | A3 | D1 | D3 | 8C | 8F | AF | B9 | E6 | EA | B3 | F1 | F3 | 9C | 9F | BF |
| IBM 852 | A4 | 8F | A8 | 9D | E3 | E0 | 97 | 8D | BD | A5 | 86 | A9 | 88 | E4 | A2 | 98 | AB | BE |
| Mazovia | 8F | 95 | 90 | 9C | A5 | A3 | 98 | A0 | A1 | 86 | 8D | 91 | 92 | A4 | A2 | 9E | A6 | A7 |
| Mac | 84 | 8C | A2 | FC | C1 | EE | E5 | 8F | FB | 88 | 8D | AB | B8 | C4 | 97 | E6 | 90 | FD |
| ISO 8859-13 andWindows-1257 | C0 | C3 | C6 | D9 | D1 | D3 | DA | CA | DD | E0 | E3 | E6 | F9 | F1 | F3 | FA | EA | FD |
| ISO 8859-16 | A1 | C5 | DD | A3 | D1 | D3 | D7 | AC | AF | A2 | E5 | FD | B3 | F1 | F6 | F7 | AE | BF |
| PN-T-42109-02:1984 "ZU0" | — | — | — | 5C | — | — | — | — | — | 60 | 7E | 40 | 7C | 5D | 7B | 5E | 5B | 7D |
| PN-T-42109-03:1986 "ZU2" | 3B | 24 | 23 | 5C | 3C | 27 | 3E | 2A | 26 | 60 | 7E | 40 | 7C | 5D | 7B | 5E | 5B | 7D |
| PN-I-10050:2002[a] | 5A | 43 08 27 | 5C | 5D | 4E 08 27 | 4F 08 27 | 53 08 27 | 5A 08 27 | 5E | 7B | 63 08 27 | 7C | 7D | 6E 08 27 | 6F 08 27 | 73 08 27 | 7A 08 27 | 7E |
| IBM 775 | B5 | 80 | B7 | AD | E0 | E3 | 97 | 8D | A3 | D0 | 87 | D3 | 88 | E7 | A2 | 98 | A5 | A4 |
| CSK | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 88 | 87 | A0 | A1 | A2 | A3 | A4 | A5 | A6 | A8 | A7 |
| Cyfromat | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 88 | 87 | 90 | 91 | 92 | 93 | 94 | 95 | 96 | 98 | 97 |
| DHN | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 88 | 87 | 89 | 8A | 8B | 8C | 8D | 8E | 8F | 91 | 90 |
| IINTE-ISIS | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 | 88 | 90 | 91 | 92 | 93 | 94 | 95 | 96 | 97 | 98 |
| IEA-Swierk | 8F | 80 | 90 | 9C | A5 | 99 | EB | 9D | 92 | A0 | 9B | 82 | 9F | A4 | A2 | 87 | A8 | 91 |
| Logic | 80 | 81 | 82 | 83 | 84 | 85 | 86 | 87 | 88 | 89 | 8A | 8B | 8C | 8D | 8E | 8F | 90 | 91 |
| Microvex | 8F | 80 | 90 | 9C | A5 | 93 | 98 | 9D | 92 | A0 | 9B | 82 | 9F | A4 | A2 | 87 | A8 | 91 |
| Ventura | 97 | 99 | A5 | A6 | 92 | 8F | 8E | 90 | 80 | 96 | 94 | A4 | A7 | 91 | A2 | 84 | 82 | 87 |
| ELWRO-Junior | C1 | C3 | C5 | CC | CE | CF | D3 | DA | D9 | E1 | E3 | E5 | EC | EE | EF | F3 | FA | F9 |
| AmigaPL | C2 | CA | CB | CE | CF | D3 | D4 | DA | DB | E2 | EA | EB | EE | EF | F3 | F4 | FA | FB |
| TeXPL | 81 | 82 | 86 | 8A | 8B | D3 | 91 | 99 | 9B | A1 | A2 | A6 | AA | AB | F3 | B1 | B9 | BB |
| Atari Club (Atari ST) | C1 | C2 | C3 | C4 | C5 | C6 | C7 | C8 | C9 | D1 | D2 | D3 | D4 | D5 | D6 | D7 | D8 | D9 |
| CorelDraw! | C5 | F2 | C9 | A3 | D1 | D3 | FF | E1 | ED | E5 | EC | E6 | C6 | F1 | F3 | A5 | AA | BA |
| ATM | C4 | C7 | CB | D0 | D1 | D3 | D6 | DA | DC | E4 | E7 | EB | F0 | F1 | F3 | F6 | FA | FC |
A common test sentence containing all the Polish diacritic letters is the nonsensical "Zażółć gęślą jaźń".