Written Chinese
From Wikipedia, the free encyclopedia
Image:Western Zhou Ritual Containers3.jpg
Left: Bronze 方樽 fāngzūn ritual wine container dated about 1000 BCE. The written inscription cast in bronze on the vessel commemorates a gift of cowrie shells (then used as currency in China) from someone of presumably elite status in 周 Zhōu Dynasty society. Right: Bronze 方彝 fāngyí ritual container dated about 1000 BCE. A written inscription of some 180 Chinese characters appears twice on the vessel. The written inscription comments on state rituals that accompanied court ceremony, recorded by an official scribe.
Written Chinese comprises the written symbols used to represent spoken Chinese and the rules about how they are arranged and punctuated. These symbols are commonly known as Chinese characters (traditional/simplified Chinese: 漢字/汉字; pīnyīn: hànzì), many of which have been traced back to the 商 Shāng Dynasty about 1500 BCE. The process of creating characters probably began some centuries earlier.[1] Over the millennia, these characters have evolved into well-developed styles of Chinese calligraphy.[2] Chinese characters were standardized under the 秦 Qín dynasty (221–206 BCE), to reflect the spoken languages and dialects of the capital city 長安/长安 Cháng'ān (modern day 西安 Xī'ān).[3] Despite historical changes in pronunciation, these characters have remained nearly constant, and Chinese speakers in disparate dialect groups can communicate in writing.[4] Educated Chinese know about 4,000 characters.[5][6] Some Chinese characters have also been adopted as part of the writing system in other East Asian languages, such as Japanese and Korean.[7][8] Chinese characters do not constitute an alphabet or a compact syllabary; they are instead built up from simpler parts representing objects or abstract notions,[9] although most characters contain some indication of their pronunciation.[10] The great number of Chinese characters has given rise to the adoption of Western alphabets as an alternative for representing Chinese.[11]
Role of Chinese charactersImage:PICT7466.JPG
Tomb of Fu Hao, c. 1200 BC, containing some 200 bronze vessels with 109 inscriptions in oracle bone script of Fu Hao's name.[12]
Written Chinese developed to represent spoken Chinese. At the inception of written Chinese, spoken Chinese was a monosyllabic language; that is, Chinese words represented independent concepts (objects, actions, relations, and so forth) that were generally only one syllable in the spoken language.[13] The Chinese language has since diversified into many dialects and these dialects have become polysyllabic. As a result, many old syllables no longer stand on their own, in the same way that pre- (Latin prefix meaning "earlier") cannot typically be used on its own as an English word.[14] However, because the meanings of modern Chinese words can usually be analyzed in terms of the old Chinese syllables that constitute them, written Chinese has been continuously used to represent individual Chinese syllables.[15] Each of these syllables represents a morpheme, or semantic unit, so written Chinese is generally (though not universally) considered to be logographic; at least one scholar considers it a large, inefficient phonetic script.[16] Chinese dialects vary not only by pronunciation, but to a lesser degree also vocabulary and syntax, so a single written Chinese standard cannot represent all dialects equally well. Modern written Chinese, which became the written standard as an indirect result of the May Fourth Movement of 1919, is not technically bound to any single dialect; however, it most nearly represents the vocabulary and syntax of Mandarin, by far the most widespread Chinese dialect in terms of both geographical area and number of speakers.[17] This version of written Chinese is called Vernacular Chinese, or 白話/白话 báihuà (literally, "clear tongue").[18] Before the development of Vernacular Chinese, the prevailing written standard was a vocabulary and syntax rooted in Chinese as spoken around the time of Confucius (about 500 BCE), called Classical Chinese, or 文言 wényán. Over the centuries, Classical Chinese gradually acquired features from various dialects. This accretion was generally slow and minor, so that just before it was supplanted by Vernacular Chinese, Classical Chinese was distinctly different from any contemporary dialect.[19] Classical Chinese retained much of the vocabulary and syntax of the two-millennia-old version of spoken Chinese it was derived from, so it was taught separately from any native dialect.[20] Once learned, however, it was a common medium for communication between people speaking different dialects—dialects that often came to be mutually unintelligible by the end of the first millennium CE.[21] A Mandarin speaker might say yī, a Cantonese yat, and a Hokkienese tsit, but all three will understand the character 一 "one".[4] Despite its ties to the dominant Mandarin dialect, Vernacular Chinese serves the same function to a degree, limited by the fact that Vernacular Chinese expressions are often ungrammatical or unidiomatic in many of the non-Mandarin dialects. This role may not differ substantially from the role of other lingua francas such as Latin: For those trained in written Chinese, it serves as a common medium; for those untrained in it, the graphic nature of the characters is in general no aid to common understanding (characters such as "one" notwithstanding).[22] The variation in vocabulary among dialects has also led to the informal use of "dialectal characters", as well as standard characters that are nevertheless considered archaic by today's standards.[23] Cantonese is unique among non-Mandarin regional languages in having a written colloquial standard, used in Hong Kong and overseas, with a large number of unofficial characters for words particular to this dialect.[24] Written colloquial Cantonese has become quite popular in online chat rooms and instant messaging, although for formal written communications Cantonese speakers still normally use standard written Chinese.[25] Chinese characters in other languagesChinese characters were first introduced into Japanese sometime in the first half of the first millennium CE, probably from Chinese products imported into Japan.[7] At the time, Japanese had no native written system, and Chinese characters were used for the most part to represent Japanese words with the corresponding meanings, rather than similar pronunciations. A notable exception to this rule was the system of man'yōgana, which used a small set of Chinese characters to help indicate pronunciation. The man'yōgana later developed into the phonetic alphabets, hiragana and katakana.[26] The Chinese characters imported into Japanese were called hànzì, after the 漢/汉 Hàn Dynasty of China; in Japanese, this was pronounced kanji. In modern written Japanese, kanji are used for nouns, verb stems, and adjective stems, while the hiragana are used for prefixes and suffixes. The katakana are used exclusively for sound symbols, and for loans from other languages. The Jōyō Kanji, a list of kanji for common use standardized by the Japanese government, contains 1,945 characters—about half the number of characters commanded by literate Chinese.[8] The role of Chinese characters in Korean and Vietnamese, in contrast, is much more limited. At one time, many Chinese characters (called hanja, a term cognate to both hànzì and kanji) were introduced into Korean for their meaning, just as in Japanese.[8] Now, written Korean relies almost exclusively on the phonetic hangul script, in which each syllable is written with two or three phonetic symbols that combine to form a single character. Similarly, the use of Chinese and Chinese-styled characters in the Vietnamese chữ nôm script has been almost entirely superseded by the quốc ngữ alphabet.[27] Structure of Chinese charactersWritten Chinese is the only major modern writing system not based predominantly on an alphabet or a compact syllabary. Instead, Chinese characters are glyphs whose parts may depict objects or represent abstract notions. These parts may occasionally stand alone as independent characters; more usually, they are combined, using a variety of different principles, to form more complex characters. The best known exposition of Chinese character composition is the 說文解字/说文解字 Shuōwén Jiězì, compiled by 許慎/许慎 Xǚ Shèn around 120 CE. Since Xǚ Shèn did not have access to Chinese characters in their earliest forms, his analysis, based as it is on somewhat later forms, cannot be taken as authoritative.[28] Nonetheless, no later work has supplanted the Shuōwén Jiězì in terms of breadth, so it remains the most accessible source for non-specialists, via its various redactions.[9] According to the Shuōwén Jiězì, Chinese characters are developed on six basic principles.[29] (These principles, though popularized by the Shuōwén Jiězì, were developed earlier; the oldest known mention of them is in the 周禮/周礼 Zhōulǐ—literally, "Rites of Zhou"—a text from about 150 BCE.[30]) The first two principles produce simple characters, known as 文 wén:[29]
The remaining four principles produce complex characters historically called 字 zì (although this term is now generally used to refer to all characters, whether simple or complex). Of these four, two construct characters from simpler parts:[29]
In contrast to the popular conception of Chinese as a primarily pictographic or ideographic language, the vast majority of Chinese characters (about 95 percent of the characters in the Shuōwén Jiězì) are constructed as either logical aggregates or, more often, phonetic complexes.[10] In fact, some phonetic complexes were originally simple pictographs that were later augmented by the addition of a semantic root. An example is 炷 zhù "candle", which was originally a pictograph 主, a character that is now pronounced zhǔ and means "host". The character 火 huǒ "fire" was added to indicate that the meaning is fire-related.[31] The last two principles do not produce new written forms; instead, they transfer new meanings to existing forms:[29]
Chinese characters are generally written to fit into a square (except for simple characters such as 一 yī "one" for which this is not possible), even when they are composed of two simpler forms written side by side or top to bottom. In such cases, each form is compressed appropriately so that the entire character continues to fit into a square.[32] Whenever writers of the Chinese encounter a new concept or object, they combine characters to signal the new object. For instance, when the Chinese discovered giraffes, they used the word cháng jǐng lù (長頸鹿/长颈鹿), meaning "long neck deer," as the name for a giraffe.[33] Chinese written formsAlthough most Chinese characters have a canonical form, there is considerable variation in how they are written or printed on a page, a variation that goes beyond the familiar notion of typeface or font for alphabetic languages. Today, there are five recognized written traditions for Chinese writing style:[2]
Regular script is considered the archetype for Chinese writing, and forms the basis for most printed forms. In addition, regular script imposes a stroke order, which must be followed in order for the characters to be written correctly.[34] (Strictly speaking, this stroke order applies to the clerical, running, and grass scripts as well, but especially in the running and grass scripts, this order is occasionally deviated from.) Thus, for instance, the character 木 mù "wood" must be written starting with the horizontal stroke, drawn from left to right; next, the vertical stroke, from top to bottom, with a small hook toward the upper left at the end; next, the left diagonal stroke, from top to bottom; and lastly the right diagonal stroke, from top to bottom.[35] Earlier forms
Replica of an ancient Chinese oracle bone.
The seal script, although the earliest surviving form of Chinese writing, does not represent the embryonic stage of Chinese writing. The first indisputable examples of Chinese writing, dating back to the Shāng Dynasty in the latter half of the second millennium BCE, were the oracle bones (primarily ox scapulae and turtle shells), used for divination. Characters were inscribed on the bones in order to frame a query; the bones were then heated over a fire, and the resulting cracks were interpreted to determine the answer to the query. Such characters are called 甲骨文 jiǎgǔwén "shell-bone script" or oracle bone script.[1] After the Shāng Dynasty, Chinese writing evolved into the form found on bronzeware made during the Western 周 Zhōu Dynasty (c 1066–770 BCE) and the Spring and Autumn Period (770–476 BCE), a kind of writing called 金文 jīnwén "metal script". Jīnwén characters are more regular and angular than the embellished script of the oracle bone script. Later, in the Warring States Period (475–221 BCE), the script became still more regular, and settled on a form, called 六國文字/六国文字 liùguó wénzì "script of the six states", that Xǔ Shèn used as source material in the Shuōwén Jiězì. These characters were later embellished and stylized to yield the seal script characters, which in turn evolved into the other surviving writing styles.[1] In 2003, tentative evidence was found at 賈湖/贾湖 Jiǎhú, an archaeological site in the 河南 Hénán province of China, for a still earlier form of Chinese writing. Some symbols were found that bear striking resemblance to certain modern characters, such as 目 mù "eye". Since the Jiǎhú site dates from about 7000 to 5800 BCE, it predates the earliest confirmed Chinese writing by well over 3,000 years. The nature of this finding—whether it represents true writing (that is, a general mechanism for expression) or simply proto-writing (which comprises a limited set of symbols)—is still disputed. Critics contend that if the Jiǎhú finding really represented a direct ancestor of modern Chinese writing, it would indicate that Chinese writing remained relatively static for three millennia, at a time when China was sparsely populated.[36] Simplified and traditional ChineseIn the 20th century, written Chinese divided into two canonical forms, called 簡體字/简体字 jiǎntǐzì (simplified Chinese) and 繁體字/繁体字 fántǐzì (traditional Chinese). Simplified Chinese was developed in the People's Republic of China (mainland China) in order to make the characters faster to write (especially as some characters had as many as a few dozen strokes) and easier to memorize. The People's Republic of China has claimed that both goals have been achieved, but some external observers disagree. Little systematic study has been conducted on how simplified Chinese has affected the way Chinese people become literate; the only studies conducted before it was standardized in mainland China seem to have been statistical ones regarding how many strokes were saved on average in samples of running text.[37]
Traditional and simplified Chinese versions of the Chinese word hànzì.
The simplified forms have also been criticized for being inconsistent. For instance, traditional 讓 ràng "allow" is simplified to 让, in which the phonetic on the right side is reduced from 17 strokes to just three. (The speech radical on the left has also been simplified.) However, the same phonetic is used in its full form, even in simplified Chinese, in such characters as 壤 rǎng "soil" and 齉 nàng "snuffle"; these forms remained uncontracted because they were relatively uncommon and would therefore represent a negligible stroke reduction.[38] On the other hand, some simplified forms are simply calligraphic abbreviations of long standing, as for example 万 wàn "ten thousand", for which the traditional Chinese form is 萬.[39] Simplified Chinese is standard in the People's Republic of China, Singapore, and Malaysia. Traditional Chinese is retained in Hong Kong, the Republic of China (Taiwan), and Macau.[40] Throughout this article, Chinese text is given in both simplified and traditional forms when they differ, with the traditional forms being given first. Layout of written ChineseChinese characters conform to a roughly square frame and are not usually linked to one another, so they could be written in any direction in a square grid. Traditionally, Chinese is written in vertical columns from top to bottom; the first column is on the right side of the page, and the text runs toward the left. Text written in Classical Chinese also uses little or no punctuation. In such cases, sentence and phrase breaks are determined by context and rhythm.[41] In modern times, the familiar Western layout of horizontal rows from left to right, read from the top of the page to the bottom, has become more popular, especially in the People's Republic of China, with the rise of Vernacular Chinese; the government of the People's Republic of Chinese mandated left-to-right writing in 1955.[42] Punctuation has also become more prevalent, whether the text is written in columns or rows. The punctuation marks are clearly influenced by their Western counterparts, although some marks are particular to Chinese: for example, the double and single quotation marks (『 』 and 「 」); the hollow period (。), which is otherwise used just like an ordinary full stop; and a special kind of comma called an enumeration comma (、), which is used to separate items in a list, as opposed to clauses in a sentence. Signs are often a particularly challenging aspect of written Chinese layout, since they can be written either left to right or right to left (the latter can be thought of as the traditional layout with each "column" being one character high), as well as from top to bottom. It is not unusual to encounter all three orientations on signs on neighboring stores.[43] However, in 2004, Taiwan mandated a Western, left-to-right layout of Chinese for most texts (excluding arts and literature).[44] LiteracyBecause the majority of modern Chinese words contain more than one character, there are at least two measuring sticks for Chinese literacy: the number of characters known, and the number of words known. John DeFrancis, in the introduction to his Advanced Chinese Reader, suggests that a typical Chinese college graduate recognizes perhaps 4,000 to 5,000 characters, and 40,000 to 60,000 words.[5] Jerry Norman, in Chinese, places the number of characters somewhat lower, at 3,000 to 4,000.[6] These counts are complicated by the tangled development of Chinese characters. In many cases, a single character came to be written in multiple ways, as with English "color/colour". This latter development was stemmed to an extent during the Qín dynasty, when 李斯 Lǐ Sī promulgated the seal script as the standard throughout the newly unified Chinese empire,[3] but soon started again. Although the Shuōwén Jiězì lists 10,516 characters—9,353 of them unique (some of which may already have been out of use by the time it was compiled) plus 1,163 graphic variants—the 集韻/集韵 Jíyùn of the Northern 宋 Sòng Dynasty, compiled in 1039, contains no fewer than 53,525 characters, most of them graphic variants.[45] Chinese dictionariesChinese is not based on an alphabet or syllabary, so Chinese dictionaries cannot be straightforwardly lexically ordered, as English dictionaries are, for instance. The need to arrange Chinese characters in order to permit efficient lookup has given rise to a considerable variety of ways to organize and index the characters.[46] A traditional mechanism is the method of radicals, which uses a set of character roots. These roots, or radicals, generally but imperfectly align with the parts used to compose characters by means of logical aggregation and phonetic complex. A canonical set of 214 radicals was developed during the rule of the 康熙 Kāngxī emperor (around the year 1700); these are sometimes called the Kāngxī radicals. The radicals are ordered first by stroke count (that is, the number of strokes required to write the radical); within a given stroke count, the radicals also have a prescribed order.[47] Every Chinese character falls under the heading of exactly one of these 214 radicals.[46] In many cases, the radicals are themselves characters, which naturally come first under their own heading. All other characters under a given radical are ordered by the stroke count of the character. Usually, however, even this level of division leads to numerous characters with a given stroke count under a given radical. At this point, characters are not given in any recognizable order; the user must locate the character by going through all the characters with that stroke count, typically listed for convenience at the top of the page on which they occur.[48] The advantage of this method is that one need not know how to pronounce a character before looking it up; the entry, once located, usually gives the pronunciation. A disadvantage is that which of the various roots of a character is the proper radical is not always immediately obvious. Accordingly, dictionaries often include a list of hard to locate characters, indexed by total stroke count, near the beginning of the dictionary.[46] Other methods of organization exist, often in an attempt to address the shortcomings of the radical method, but are less common. An exhaustive list is not possible; however, a selection follows:
| |||||||||||||||||||||||||||||||||||||||||


