| Written Chinese | |||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chinese | 中文 | ||||||||||||||||||||||||||||||||||||||||||||
| Hanyu Pinyin | Zhōngwén | ||||||||||||||||||||||||||||||||||||||||||||
| Literal meaning | Chinese writing | ||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||
| Han writing[a] | |||||||||||||||||||||||||||||||||||||||||||||
| Simplified Chinese | 汉文 | ||||||||||||||||||||||||||||||||||||||||||||
| Traditional Chinese | 漢文 | ||||||||||||||||||||||||||||||||||||||||||||
| Hanyu Pinyin | Hànwén | ||||||||||||||||||||||||||||||||||||||||||||
| Literal meaning | Han writing | ||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||
| Chinese characters |
|---|
Collation and standards
|
Homographs and readings |
Written Chinese is awriting system thattranscribes thevarieties ofChinese language usinglogograms — known ascharacters — and other symbols such aspunctuations. Chinese characters do not directly representpronunciation, unlike letters in analphabet or syllabograms in asyllabary. Rather, the writing system ismorphosyllabic: characters are one spokensyllable in length, but generally correspond tomorphemes in the language, which may either be independent words, or part of a polysyllabic word. Most characters are constructed fromsmaller components known asradicals orpianpang that may reflect the character's meaning or pronunciation.[1]Literacy requires the memorization of thousands of characters; college-educated Chinese speakers know approximately 4,000 characters.[2][3] This has led in part to the modern adoption of complementaryphonetic transcription systems such asPinyin andBopomofo totransliterate the pronunciation of each character.[4][5]
Chinese writing is first attested during the lateShang dynasty (c. 1250 – c. 1050 BCE) in the form oforacle bone script,[6][7][8] but the process of creatingcharacter-like symbols is thought to have begun centuries earlier during theLate Neolithic and earlyBronze Age (c. 2500–2000 BCE).[9] During theZhou dynasty (1046–256 BC), Chinese characters evolved into the more maturebronze script andseal script varieties, the latter of which were standardized under the short-livedQin dynasty (221–206 BCE)[10] and further consolidated into the more convenientclerical script (a process known aslibian) during the subsequentHan dynasty (202 BC–220 AD). Over the following millennia, these characters have evolved into well-developed styles ofChinese calligraphy, from the formalregular script to the more casualrunning script andcursive script,[11] as well as standardizedprinting-orientedtypefaces such asSong,Fangsong andgothic. As the differentSinitic varieties diverged, a situation ofdiglossia developed, with speakers of otherwise mutually unintelligible varieties able to communicate through writing usingLiterary Chinese.[12] In the early 20th century, Literary Chinese was replaced in large part withwritten vernacular Chinese, largely corresponding to the grammar of modernStandard Chinese, astandard form based on theBeijing dialect ofMandarin. Although most other Chinese varieties are not written in their own vernacular styles, there are local traditions ofwritten Cantonese,written Shanghainese andwritten Hokkien, among others.
Written Chinese is not based on an alphabet or syllabary.[13] Most characters can be analyzed as compounds of smaller components, which may be assembled according to several different principles. Characters and components may reflect aspects of meaning or pronunciation. The best known exposition of Chinese character composition is theShuowen Jiezi, compiled byXu Shenc. 100 CE. Xu did not have access to the earliest forms of Chinese characters, and his analysis is not considered to fully capture the nature of the writing system.[14] Nevertheless, no later work has supplanted theShuowen Jiezi in terms of breadth, and it is still relevant to etymological research today.[15]
According to theShuowen Jiezi, Chinese characters are developed on six basic principles.[16] (These principles, though popularized by theShuowen Jiezi, were developed earlier; the oldest known mention of them is in theRites of Zhou, a text fromc. 150 BCE.[17]) The first two principles produce simple characters, known as文 (wén):[16]
The remaining four principles produce complex characters historically called字 (zì), though this term is now generally used to refer to all characters, whether simple or complex. Of these four, two construct characters from simpler parts:[16]
The last two principles do not produce new written forms; they instead transfer new meanings to existing forms:[16]
In contrast to the popular conception of written Chinese asideographic, the vast majority of characters—about 95% of those in theShuowen Jiezi—either reflect elements of pronunciation, or are logical aggregates.[1] In fact, some phonetic complexes were originally simple pictographs that were later augmented by the addition of a semantic root. An example is炷;zhù; 'lampwick', now archaic, which was originally a pictograph of a lamp stand主, a character that is now pronouncedzhǔ and means 'host', or the character火 (huǒ; 'fire') was added to indicate that the meaning is fire related.[18]
Chinese characters are written to fit into a square, even when composed of two simpler forms written side-by-side or top-to-bottom. In such cases, each form is compressed to fit the entire character into a square.[19]
Character components can be further subdivided into individual written strokes. The strokes of Chinese characters fall into eight main categories: "horizontal"⟨一⟩, "vertical"⟨丨⟩, "left-falling"⟨丿⟩, "right-falling"⟨丶⟩, "rising", "dot"⟨、⟩, "hook"⟨亅⟩, and "turning"⟨乛⟩,⟨乚⟩,⟨乙⟩.[20]
There are eight basic rules of stroke order in writing a Chinese character, which apply only generally and are sometimes violated:[21]

As characters are essentially rectilinear and are not joined with one another, written Chinese does not require a set orientation. Chinese texts were traditionally written in columns from top to bottom, which were laid out from right to left. Prior to the 20th century, Literary Chinese used little to no punctuation, with the breaks between sentences and phrases determined largely by context and the rhythms implied by patterns of syllables.[22]
In the 20th century, the layout used in Western scripts—where text is written in rows from left to right, which are laid out from top to bottom—became predominant in mainland China, where it was mandated by the Chinese government in 1955. Vertical layouts are still used for aesthetic effect, or when space limitations require it, such as on signage or book spines.[23] The government ofTaiwan followed suit in 2004 for official documents, but vertical layouts have persisted in some books and newspapers.[24]
Less frequently, Chinese is written in rows from right to left, usually on signage or banners, though a left to right orientation remains more common.[25]
The use of punctuation has also become more common. In general, punctuation occupies the width of a full character, such that text remains visually well-aligned in a grid. Punctuation used in simplified Chinese shows clear influence from that used in Western scripts, though some marks are particular to Asian languages. For example, there are double and single quotation marks (『 』 and 「 」), and a hollow full stop (。), which is used to separate sentences in an identical manner to a Western full stop. A special mark called anenumeration comma (、) is used to separate items in a list, as opposed to the clauses in a sentence.

Written Chinese is one of the oldest continuously used writing systems.[26] The earliest examples universally accepted as Chinese writing are theoracle bone inscriptions made during the reign of theShang kingWu Ding (c. 1250 – c. 1192 BCE). These inscriptions were made primarily on ox scapulae and turtle shells in order to record the results of divinations conducted by the Shang royal family. Characters posing a question were first carved into the bones. The question's answer was then divined by heating the bones over a fire and interpreting the resulting cracks that formed. The interpretation was then carved into the sameoracle bone.[27]
In 2003, 11 isolated symbols carved on tortoise shells were found at theJiahu archaeological site inHenan—with some bearing a striking resemblance to certain modern characters, such as目 (mù; 'eye'). The Jiahu site dates fromc. 6600 BCE, predating the earliest attested Chinese writing by more than 5,000 years. Garman Harbottle, who had headed a team of archaeologists at theUniversity of Science and Technology of China in Anhui—has suggested that these symbols were precursors to Chinese writing. However, the palaeographerDavid Keightley argues instead that the time gap is too great to establish any connection.[28]From theLate Shang period (c. 1250 – c. 1050 BCE), Chinese writing evolved into the form found incast inscriptions on ritual bronzes made during theWestern Zhou dynasty (c. 1046 – 771 BCE) and theSpring and Autumn period (771–476 BCE), a form of writing called bronze script (金文;jīnwén). Bronze script characters are less angular than their oracle bone script counterparts. The script became increasingly regularized during theWarring States period (475–221 BCE), settling into what is called 'script of the six states' (六国文字;liùguó wénzì), that Xu Shen used as source material in theShuowen Jiezi. These characters were later embellished and stylized to yield theseal script, which represents the oldest form of Chinese characters still in modern use. They are used principally forsignature seals, or chops, which are often used in place of a signature for Chinese documents and artwork.Li Si promulgated the seal script as the standard throughout China, which had been recently united under the imperialQin dynasty (221–206 BCE).[10]
The initial adaptation of seal intoclerical script can be attributed to scribes in thestate of Qin working prior to the wars of unification. Clerical script forms generally have a "flat" appearance, being wider than their seal script equivalents.[29] In thesemi-cursive script that evolved from clerical script, character elements begin to run into each other, though the characters themselves generally remain discrete. This is contrasted with fullycursive script, where characters are often rendered unrecognizable by their canonical forms.[30]Regular script is the most widely recognized script, and was considerably influenced by semi-cursive. In regular script, each stroke of each character is clearly drawn out from the others.[31]
Regular script is considered the archetypal Chinese writing and forms the basis for most printed forms. In addition, regular scriptimposes a stroke order, which must be followed in order for the characters to be written correctly.[32] Strictly speaking, this stroke order applies to the clerical, running, and grass scripts as well, but especially in the running and grass scripts, this order is occasionally deviated from. Thus, for instance, the character木 (mù; 'wood') must be written starting with the horizontal stroke, drawn from left to right; next, the vertical stroke, from top to bottom; next, the left diagonal stroke, from top to bottom; and lastly the right diagonal stroke, from top to bottom.[33]
Beginning in the mid-20th century, Chinese has primarily been written using eithersimplified ortraditional character forms. Simplified characters, which merge some character forms and reduce the average stroke count per character, were developed by the Chinese government with the stated goal of increasing literacy among the population. During this time, literacy rates did increase rapidly, but some observers instead attribute this to other education reforms and a general increase in the standard of living. Little systematic research has been conducted to support the conclusion that the use of simplified characters has affected literacy rates; studies conducted in China have instead focused on arbitrary statistics, such as quantifying the number of strokes saved on average in a given text sample.[34] Simplified characters are standard in mainland China,Singapore andMalaysia, while traditional characters are standard inHong Kong,Macau,Taiwan and someoverseas Chinese communities.[35][page needed]
Simplified forms have also been characterized as being inconsistent. For instance, the traditional讓;ràng; 'allow' is simplified to让, in which the phonetic on the right side is reduced from 17 strokes to 3, and the⾔ 'SPEECH' radical on the left also being simplified. However, the same phonetic component is not reduced in simplified characters such as壤;rǎng; 'soil' and齉;nàng; 'snuffle'—these characters are relatively uncommon, and would therefore represent a negligible stroke reduction.[36] Other simplified forms derive from long-standing calligraphic abbreviations, as with万;wàn; 'ten thousand', which has the traditional form of萬.[37]

Chinese characters have always been used to represent individual spoken syllables. While writing was being invented in the Yellow River valley, words in spoken Chinese were largely monosyllabic, and each written character corresponded to a monosyllabic word.[39] Spoken Chinese varieties have since acquired much more polysyllabic vocabulary,[40] usually compound words composed of morphemes corresponding to older monosyllabic words.[41]
For over two thousand years, the predominant form of written Chinese wasLiterary Chinese, which had vocabulary and syntax rooted in the language of theChinese classics, as spoken around the time ofConfucius (c. 500 BCE). Over time, Literary Chinese acquired some elements of grammar and vocabulary from various varieties of vernacular Chinese that had since diverged. By the 20th century, Literary Chinese was distinctly different from any spoken vernacular, and had to be learned separately.[42][43] Once learned, it was a common medium for communication between people speaking different dialects, many of which were mutually unintelligible by the end of the first millennium CE.[44][12]

Varieties of Chinese vary in pronunciation, and to a lesser extent in vocabulary and grammar.[45] Modern written Chinese, which replaced Classical Chinese as the written standard as an indirect result of the 1919May Fourth Movement, is not technically bound to any single variety; however, it most nearly represents the vocabulary and syntax of Mandarin, by far the most widespread Chinese dialectal family in terms of both geographical area and number of speakers.[46] This form is known aswritten vernacular Chinese.[47] While some written vernacular Chinese expressions are often ungrammatical or unidiomatic outside of Mandarin, its use permits some communication between speakers of different dialects. This function may be considered analogous to that oflinguae francae, such asLatin. For literate speakers, it serves as a common medium; however, the forms of individual characters generally provide little insight to their meaning if not already known.[48] Ghil'ad Zuckermann's exploration ofphono-semantic matching inStandard Chinese concludes that the Chinese writing system is multifunctional, conveying both semantic and phonetic content.[49]
The variation in vocabulary among varieties has also led to informal use of "dialectal characters", which may include characters previously used in Literary Chinese that are considered archaic in written Standard Chinese.[50] Cantonese is unique among non-Mandarin regional languages in having a written colloquial standard, used in Hong Kong and overseas, with a large number of unofficial characters for words particular to this language.[51]Written Cantonese has become quite popular on the Internet, while Standard Chinese is still normally used in formal written communications.[52] To a lesser degree,Hokkien is used similarly in Taiwan and elsewhere, though it lacks the level of standardization seen in Cantonese. However, Taiwan's Ministry of Education has promulgated a standard character set for Hokkien, which is taught in schools and encouraged for use by the general population.[53]
Over the history of written Chinese, a variety of media have been used for writing. They include:
Since at least the Han dynasty, such media have been used to createhanging scrolls andhandscrolls.
Because the majority of modern Chinese words contain more than one character, there are at least two measuring sticks for Chinese literacy: the number of characters known, and the number of words known.John DeFrancis, in the introduction to hisAdvanced Chinese Reader, estimates that a typical Chinese college graduate recognizes 4,000 to 5,000 characters, and 40,000 to 60,000 words.[2]Jerry Norman, inChinese, places the number of characters somewhat lower, at 3,000 to 4,000.[3] These counts are complicated by the tangled development of Chinese characters. In many cases, a single character came to have multiplevariants. This development was restrained to an extent by the standardization of the seal script during the Qin dynasty, but soon started again. Although theShuowen Jiezi lists 10,516 characters—9,353 of them unique (some of which may already have been out of use by the time it was compiled) plus 1,163 graphic variants—theJiyun of the NorthernSong dynasty, compiled less than a thousand years later in 1039, contains 53,525 characters, most of them graphic variants.[54]
Written Chinese is not based on an alphabet or syllabary, so Chinese dictionaries, as well as dictionaries that define Chinese characters in other languages, cannot easily be alphabetized or otherwise lexically ordered, as English dictionaries are. The need to arrange Chinese characters in order to permit efficient lookup has given rise to a considerable variety of ways to organize and index the characters.[55]
A traditional mechanism is the method of radicals, which uses a set of character roots. These roots, or radicals, generally but imperfectly align with the parts used to compose characters by means of logical aggregation and phonetic complex. Acanonical set of 214 radicals was developed during the rule of theKangxi Emperor (around the year 1700); these are sometimes called the Kangxi radicals. The radicals are ordered first by stroke count (that is, the number of strokes required to write the radical); within a given stroke count, the radicals also have a prescribed order.[56]

Every Chinese character falls (sometimes arbitrarily or incorrectly) under the heading of exactly one of these 214 radicals.[55] In many cases, the radicals are themselves characters, which naturally come first under their own heading. All other characters under a given radical are ordered by the stroke count of the character. Usually, however, there are still many characters with a given stroke count under a given radical. At this point, characters are not given in any recognizable order; the user must locate the character by going through all the characters with that stroke count, typically listed for convenience at the top of the page on which they occur.[57]
Because the method of radicals is applied only to the written character, one need not know how to pronounce a character before looking it up; the entry, once located, usually gives the pronunciation. However, it is not always easy to identify which of the various roots of a character is the proper radical. Accordingly, dictionaries often include a list of hard to locate characters, indexed by total stroke count, near the beginning of the dictionary. Some dictionaries include almost one-seventh of all characters in this list.[55] Alternatively, some dictionaries list "difficult" characters under more than one radical, with all but one of those entries redirecting the reader to the "canonical" location of the character according to Kangxi.
Other methods of organization exist, often in an attempt to address the shortcomings of the radical method, but are less common. For instance, it is common for a dictionary ordered principally by the Kangxi radicals to have an auxiliary index by pronunciation, expressed typically in eitherpinyin orbopomofo.[58] This index points to the page in the main dictionary where the desired character can be found. Other methods use only the structure of the characters, such as thefour-corner method, in which characters are indexed according to the kinds of strokes located nearest the four corners (hence the name of the method),[59] or theCangjie method, in which characters are broken down into a set of 24 basic components.[60] Neither the four-corner method nor the Cangjie method requires the user to identify the proper radical, although many strokes or components have alternate forms, which must be memorized in order to use these methods effectively.
The availability of computerized Chinese dictionaries now makes it possible to look characters up by any of the indexing schemes described, thereby shortening the search process.
Chinese characters do not reliably indicate their pronunciation. Therefore, many transliteration systems have been developed to write the sounds of different varieties of Chinese. While many use theLatin alphabet, systems using theCyrillic andPerso-Arabic alphabets have also been designed. Among other purposes, these systems are used by students learning the corresponding varieties. The replacement of Chinese characters with a phonetic writing system was first prominently proposed during the May Fourth Movement, partly motivated by a desire to increase the country's literacy rate. The idea gained further support following the victory of the Communists in 1949, who immediately began two parallel programs regarding written Chinese. The first was the development of an alphabet to write the sounds of Mandarin, the variety spoken by around two-thirds of the Chinese population.[45] The other program investigated the simplification of the standard character forms. Initially, character simplification was not competing with the idea of a phonetic script; rather, simplification was intended to make the transition to a fully phonetic writing system easier.[5]
By 1958, official priorities had shifted towards character simplification. TheHanyu Pinyin (or simply 'pinyin') alphabet had been developed, but plans to replace Chinese characters with it were deferred, and the idea is no longer actively pursued. This change in priorities may have been due in part to pinyin's design being specific to Mandarin, to the exclusion of other dialects.[61]
Pinyin uses the Latin alphabet withdiacritics to represent the phonology of Standard Chinese. For the most part, pinyin uses phonetic values for letters that reflect their existing pronunciations in Romance languages and theInternational Phonetic Alphabet (IPA). However, pairs of letters such asb andp that correspond to avoicing distinction in languages such as French instead represent theaspiration distinction that is more abundant in Mandarin.[45] Pinyin also uses several consonantal letters to represent markedly different sounds from their assignments in other languages. For example, pinyinq andx correspond to sounds similar to Englishch andsh, respectively. While pinyin has become the predominant transliteration system for Mandarin, others includebopomofo,Wade–Giles,Yale,EFEO andGwoyeu Romatzyh.[62]
Official Taiwanese documents can no longer be written from right to left or from top to bottom in a new law passed by the country's parliament