
| Chinese characters |
|---|
Collation and standards
|
Homographs and readings |
TheChinese family of scripts includes writing systems used to write various East Asian languages, that ultimately descend from theoracle bone script invented in theYellow River valley during theShang dynasty. These includewritten Chinese itself, as well as adaptations of it for other languages, such as Japanesekanji, Koreanhanja, Vietnamesechữ Hán andchữ Nôm,Zhuangsawndip, and Baibowen. More divergent are theTangut script,Khitan large script,Khitan small script and its offspring, theJurchen script, as well as theYi script,Sui script, andGeba syllabary, which were inspired by written Chinese but not descended directly from it. While written Chinese and many of its descendant scripts arelogographic, others are phonetic, including thekana,Nüshu, andLisu syllabaries, as well as thebopomofo semi-syllabary.[1]
These scripts are written invarious styles, principallyseal script,clerical script,regular script,semi-cursive script, andcursive script. Adaptations range from the conservative, as in Korean, which used Chinese characters in their standard form with only a few local coinages, and relatively conservative Japanese, which has coined a few hundred new characters and used traditional character forms until the mid-20th century, to the extensive adaptations of Zhuang and Vietnamese, each coining over 10,000 new characters by Chinese formation principles, to the highly divergentTangut script, which formed over 5,000 new characters by its own principles.

The earliest known examples of Chinese writing areoracle bone inscriptions madec. 1200 BC at Yin (near modernAnyang), the site of the final capital of theShang dynasty (c. 1600 – c. 1046 BC). These inscriptions were carved into oxscapulae and tortoiseplastrons, and recorded the results of official divinations conducted by the Shang royal house.[2] The script shows extensive simplification and linearization, believed by most researchers to indicate an extensive development of the script prior to the oldest samples.[3] Whilevarious symbols inscribed on pieces of pottery, jade, and bone have been found at Neolithic sites across China, there is no clear evidence of any relation to Shang oracle bone script.[4]Inscriptions on bronze vessels using a developed form of the Shang script dating toc. 1100 BC have also been discovered, and have provided a richer corpus.[5]
Each character of the early script represents anOld Chinese word, which were uniformly monosyllabic at that time.[3] Characters are traditionally classified according to a system of six categories (六書;liùshū; 'six writings') according to the apparent strategy used to create them. This system was first made popular by theShuowen Jiezi dictionary (c. 100 AD). Three of these categories involved a representation of the meaning of the word:
Characters directly descendant of these forms remain still among the most commonly used today.[6]
Words that could not be represented pictorially, such as abstract terms and grammatical particles, were denoted using therebus strategy, selecting characters for similar-sounding words. Thus, these phonetic loans (假借字;jiǎjièzì) are new uses of existing characters rather than new graphic forms.[7] An example is來;lái; 'come', written with the character for a similar-sounding word meaning 'wheat'.[8] The borrowed character was sometimes modified slightly to distinguish it from the original, as with毋;wú; 'do not', a borrowing of母;mǔ; 'mother'.[9]
Phono-semantic compounds (形聲字;xíngshēngzì) were obtained by adding semantic indicators to disambiguate phonetic loans. This type was already used extensively on the oracle bones, and has been the main source of new characters since then. For example, the character其 originally representingjī; 'winnowing basket' was also used to write the pronoun and modal particleqí. Later the less common original word was written with the compound箕, obtained by adding the symbol竹;zhú; 'bamboo' to the character.[10] Sometimes the original phonetic similarity has been obscured by millennia ofsound change, as in格;gé <*krak 'go to' and路;lù <*graks 'road'.[11] Many characters often explained as semantic compounds were originally phono-semantic compounds that have been obscured in this way. Some authors even dispute the validity of the semantic compound category.[12]
The sixth traditional category (轉注字;zhuǎnzhùzì) contains very few characters; its meaning is uncertain.[13]
Development and simplification of the script continued during theWestern Zhou andSpring and Autumn periods, with characters becoming less pictorial and more linear and regular, with rounded strokes being replaced by sharp angles. Writing became more widespread during theWarring States period, as well as further simplified and more varied, particularly in the eastern states. After the western state ofQin unified China, its more conservativeseal script became the standard across the entire country.[14] A simplified form known asclerical script became the standard during theHan dynasty, and later evolved intoregular script, which remains in use.[15]At the same time,semi-cursive andcursive scripts developed.[16]
Thetraditional Chinese script is currently used in Taiwan, Hong Kong and Macau. Mainland China and Singapore use thesimplified Chinese variant.
Until the early 20th century, formal writing employedLiterary Chinese, based on the vocabulary and syntax of theChinese classics. The script was also used less formally to record local varieties, which had over time diverged from the classical language and each other. The logographic script easily accommodated differences in pronunciation, meaning and word order, but often new characters were required for words that could not be related to older forms. Many such characters were created using the traditional methods, particularly phono-semantic compounds.[17]
For many centuries, the Chinese script was the only writing system in East Asia, and had a huge influence as the vehicle for the dominance of Chinese culture. Korea, Japan and Vietnamadopted Chinese literary culture as a whole. For many centuries, all writing in neighbouring countries was inLiterary Chinese, albeit influenced by the writer's native language. Although they wrote in Chinese, writing about local subjects required characters to represent names of local people and places; leading to the creation of Han characters specific to other languages, some of which were later re-imported as Chinese characters. Later they sought to use the script to write their own languages. Chinese characters were adapted to represent the words of other languages using a range of strategies, including
The principle of representing one monosyllabic word with one character was readily applied to neighbouring languages to the south with a similaranalytic structure to Chinese, such asVietnamese andZhuang. The script was a poorer fit for the polysyllabicagglutinative languages of the north-east, such asKorean,Japanese and theMongolic andTungusic languages.[18]


Chinese characters adapted to writeJapanese words are known askanji. Chinese words borrowed into Japanese could be written with the Chinese character, while Japanese words could be written using the character for a Chinese word of similar meaning. Because there have been multiple layers of borrowing into Japanese, a single kanji may have severalreadings in Japanese.[19]
Other systems, known askana, used Chinese characters phonetically to transcribe the sounds of Japanese syllables. An early system of this type wasman'yōgana, as used in the 8th-century anthologyMan'yōshū. This system was not quite asyllabary, because each Japanese syllable could be represented by one of several characters, but from it were derived two syllabaries still in use today. They differ because they sometimes selected different characters for a syllable, and because they used different strategies to reduce these characters for easy writing: the angularkatakana were obtained by selecting a part of each character, whilehiragana were derived from the cursive forms of whole characters. Such classic works asLady Murasaki'sThe Tale of Genji were written in hiragana, the only system permitted to women of the time.[20]
Modern Japanese writing employs a composite script system consisting of kanji, hiragana, and katakana. Kanji are used primarily for content words such as nouns, verb stems, and adjective stems. Hiragana is employed for grammatical elements, such as inflectional endings and function words, as well as for some native Japanese words that lack commonly used kanji or whose kanji are considered obscure or difficult to write. Katakana is typically used to transcribe foreign (non-Sino-Japanese) loanwords, represent onomatopoeic expressions, render colloquial or informal terms, indicate emphasis (similar to italics or capitalization in English), and provide alternative spellings for certain words.[21] A few hundred characters have been coined in Japan; these are known askokuji, and include natural phenomena, particularly fish, such as鰯; 'sardine', together with everyday terms such as働; 'work' and technical terms such as腺; 'gland'.
Chinese characters adapted to write Korean are known asHanja. From the 9th century,Korean was written using a number of systems collectively known asIdu, in which Hanja were used to write both Sino-Korean and native Korean roots, and a smaller number of Hanja were used to write Korean grammatical morphemes with similar sounds. The overlapping uses of Hanja made the system complex and difficult to use, even when reduced forms for grammatical morphemes were introduced with theGugyeol system in the 13th and 14th centuries.[22]
TheHangul alphabet introduced in the 15th century was much simpler, and specifically designed for the sounds of Korean. The alphabet makes systematic use of modifiers corresponding to features of Korean sounds. Although Hangul is unrelated to Chinese characters, its letters are written in syllabic blocks that can be interspersed with Hanja. Such aKorean mixed script became the usual way of writing the language, with roots of Chinese origin denoted by Hanja and all other elements rendered in Hangul. Hanja is still used (but not very commonly like the Japanese) and is required in both North and South Korea.[23]
Historically, a few characters known asgukja were coined in Korea; one example is畓 'rice paddy'.

Vietnamese was first written from the 13th century using thechữ Nôm script based on Chinese characters, but the system developed in a quite different way than in Korea or Japan. Vietnamese was and is a strongly analytic language with many distinct syllables (roughly 4,800 in the modern standard language), so there was little motivation to develop a syllabary. As with Korean and Japanese, characters were used to write borrowed Chinese words, native words with a similar sound and native words with a similar meaning. In the Vietnamese case, the latter category consisted mainly of early loans from Chinese that had come to be accepted as native. The Vietnamese system also involved creation of new characters using Chinese principles, but on a far greater scale than in Korea or Japan. The resulting system was highly complex and was never mastered by more than 5% of the population. It was completely replaced in the 20th century by the Latin-basedVietnamese alphabet.[24][25]
Zhuang has been written usingSawndip for over a thousand years. The script uses both Chinese characters and new characters formed using the traditional methods, as well as some formed by combining pairs of characters to indicate the pronunciation of a word by thefanqie method.[26] The number of new created characters is similar in scale to thechữ Nôm of Vietnam. Even though an official alphabet-based writing system for Zhuang was introduced in 1957, Sawndip is still more often used in less formal situations.[27]
Several peoples in southwest China recorded laws, songs and other religious and cultural texts by representing words of their languages using a mix of Chinese characters with a similar sound or meaning, or pairs of Chinese characters indicating pronunciation using thefanqie method. The languages so recorded includedMiao,Yao,[28]Bouyei,[29]Kam,[30]Bai[31] andHani.[1] All these languages are now written using Latin-based scripts.
Chinese characters were also used to transcribe theMongolian text ofThe Secret History of the Mongols.

Between the 10th and 13th centuries, northern China was ruled by foreign dynasties that created scripts for their own languages.TheKhitan large script andKhitan small script, which in turn influenced theTangut script andJurchen script, used characters that superficially resemble Chinese characters, but with the exception of a few loans were constructed using quite different principles.In particular the Khitan small script contained phonetic sub-elements arranged in a square block in a manner similar to the more sophisticated Hangul system devised later for Korean.[32]
Other scripts in China that borrowed or adapted some Chinese characters but are otherwise distinct includeBa–Shu scripts[33]Geba script,Sui script,Yi script and theLisu syllabary.[1]