Recent from talks
Nothing was collected or created yet.
Four-corner method
View on WikipediaThis article needs additional citations for verification. (August 2009) |

The four-corner method or four-corner system (simplified Chinese: 四角号码检字法; traditional Chinese: 四角號碼檢字法; pinyin: sì jiǎo hàomǎ jiǎnzì fǎ; lit. 'four corner code lookup-character method') is a character-input method used for encoding Chinese characters into either a computer or a manual typewriter, using four or five numerical digits per character.
The four digits encode the shapes found in the four corners of the symbol, upper left to lower right. Although this does not uniquely identify a Chinese character, it leaves only a very short list of possibilities. A fifth digit can be added to describe an extra part above the lower right if necessary.
The four-corner method, in its three revisions, was supported by the Chinese state for a while, and is found in numerous older reference works and some still in publication. The small Kangorin Sino-Japanese Dictionary by Yoneyama had a four-corner index when it was introduced in the 1980s, but it has since been deleted. However, it is not in common usage today, although dictionaries using it are available. It is identified, in public opinion, with the time when many Chinese were illiterate and the language was not yet unified; more Chinese today use the dictionary to help them write, not read. But it is useful for scholars, clerks, editors, compilers, and especially for foreigners who read Chinese. In recent years it has achieved a new usage as a character input system for computers, generating very short lists to browse.
Origin
[edit]The four-corner method was invented in the 1920s by Wang Yunwu, the editor in chief at Commercial Press Ltd., China. Its original purpose was to aid telegraphers in looking up Chinese telegraph code numbers in use at that time from long lists of characters. This was mentioned by Wang Yunwu in an introductory pamphlet called Four-Corner Method, published in 1926. Cai Yuanpei and Hu Shih wrote introductory essays for this pamphlet.
Mnemonics
[edit]The four digits used to encode each character are chosen according to the "shape" of the four corners of each character.[clarification needed] In order, these corners are upper left, upper right, lower left and lower right. The shapes can be memorized using a poem composed by Hu Shih, called Bihuahaoma Ge (筆畫號碼歌; Bǐhuà hàomǎ gē; 'stroke number song'), as a "memory key" to the system:
| Traditional | Simplified | Pinyin | Meaning | |||
|---|---|---|---|---|---|---|
|
一橫二垂三點捺, |
一横二垂三点捺, |
Yī héng, èr chuí, sān diǎn, nà; |
1 for horizontal, 2 vertical, 3 is a dot; |
In the 1950s, lexicographers in the People's Republic of China changed the poem somewhat in order to avoid association with Hu Shih, who had criticized the Chinese Communist Party, although the contents remain generally unchanged. The 1950s version is as follows:
| Traditional | Simplified | Pinyin | Meaning | |||
|---|---|---|---|---|---|---|
|
横一垂二三點捺, |
横一垂二三点捺, |
Héng yī, chuí èr, sān diǎn, nà; |
horizontal is 1, vertical 2, 3 is a dot; |
Several other notes:
- A single stroke can be represented in more than one corner, as is the case with many curly strokes. (e.g. the code for 乙 is 1771)
- If the character is fenced by 囗, 門 (门), or 鬥, the lower corners are used to denote what is inside the radical, instead of 00 for 囗 or 22 for the others. (e.g. the code for 回 is 6060)
There have been scores, maybe hundreds, of such numerical and alpha-numerical systems proposed or popularized (such as Lin Yutang's "Instant Index", Trindex, Head-tail, Wang An's Sanjiahaoma, Halpern); some Chinese refer to these generically as sijiaohaoma (after the original pamphlet) though this is not correct.
Versions
[edit]Over time, the four-corner method has gone through some changes.
First Version
[edit]The first (revised) version was published in Shanghai in 1928. It was quickly adopted and popularized as a method for (among other things):
- Arranging and indexing Chinese characters in dictionaries
- Indexing Chinese classical and modern books, libraries, hospital and police records
- Chinese typewriters
- Military code making (for handling the characters quickly)
The Wang Yun-wu Da Cidian of 1928 was remarkable for its time, and although the pronunciations were very much in line with today's Standard Chinese, the lack of a phonetic index diminished its overall usefulness. The northern Mandarin pronunciations were given in Gwoyeu Romatzyh, a romanization system devised by linguist Zhao Yuanren, as well as in Mandarin Phonetic System (MPS or Bopomofo) characters with a dotted corner for tone. It also delineated parts of speech, and all compounds were listed by the four-corner method as well.
The famed lexicographer and editor of Ciyuan, Lu Erkui, as well as other lexicographers, became early proponents of the four-corner method. By 1931, it was used extensively by the Commercial Press to index virtually all classical reference works and collections of China, such as the Pei Wen Yun Fu and Complete Library of the Four Treasuries, as well as many modern ones.
Hospital, personnel and police records were organized just like the biographical indexes and dynastic histories of former times. For a while (Nash, Trindex, 1930), it seemed that use of the 214 Kangxi radicals, introduced during the Qing dynasty, was being replaced by the four-corner method.
Internationally, Harvard and other universities were using the method for their book collections, and the KMT government in Nanjing seemed to have selected this numerical system as its standard. It was taught in primary schools to children in Shanghai and other locations during the late 1920s and throughout the 1930s, up to the outbreak of war with Japan in 1937. The four-corner method was extremely popular in government education circles to promote spoken language unification until pronunciation-based systems became fashionable in the mid-1930s.
The first large-scale project to promote spoken language unification was in 1936: Wang Li's 4-volume Mandarin Phonetic System entry, Guoyu Cidian. In 1949 it was re-edited into the MPS Hanyu Da Cidian with Kangxi radical index, and a small Four Corner dictionary was available as the Xin Sijiaohaoma Cidian of 1953. After 1949, limited use of MPS and the original four-corner method continued in the People's Republic of China, until the introduction of pinyin in 1958 and after. Today's Chinese dictionaries still contain MPS characters below each pinyin class entry and sometimes in a phonetics chart in tables (Xinhua Cidian), while main entries are all in Hanyu Pinyin order. There is one all-sijiaohaoma small dictionary (Third Revision, below).
Second Revision
[edit]A minor Second Revision was made during and just after World War II. This was used by most postwar lexicographers including Morohashi Tetsuji, who created his 12-volume Sino-Japanese dictionary, the Dai Kan-Wa jiten and included the four-corner index among several other lookup methods. Oshanin included a four-corner index in his Chinese-Russian dictionary and an edition of the 25 Histories (Ershiwu shi) was published in the early 1950s with a four-corner index volume containing the entire content.
Then, in 1958, with the introduction of pinyin, a small Xin Sijiaohaoma Cidian was produced by the Beijing Commercial Press, but the rapid Han character simplification of the following years made the small (30,000 compound) book obsolete in China. Overseas and in Hong Kong, it remained popular for a number of years as a high speed key to phonetic dictionaries and indexes. It was used by those partly literate in or unfamiliar with Standard Chinese, especially Hanyu Pinyin.
Wang Yunwu produced a Xiao Cidian and Zonghe Cidian in the late 1940s; the latter remains in print in Taiwan, with an auxiliary section of rare characters and gives the telecode number, radical and stroke counts for each character.
Third Revision
[edit]During the Cultural Revolution in mainland China, the four-corner method underwent a radical Third Revision during the compilation of the experimental volume of the Xiandai Hanyu Cidian, Commercial Press, Beijing, 1972. Another medium-sized dictionary, the Xinhua Zidian, appeared with this index as well, but in the late 1990s the four-corner index disappeared from newer editions. Both works now use only the pinyin main entry and multi-door radical index systems that make it possible to look up a character with perhaps a wrong radical (i.e., characters appear redundantly under different radicals) and the number of strokes and variant forms are greatly reduced, and many more people are literate and capable of transcribing Chinese with pinyin. The use of stroke counting and radicals puts memorization of the character ahead of sheer speed in handling it. This method is more supportive of mass literacy than classical scholarship or processing and filing names or characters for the majority in China today.
The four-corner method is ultimately for readers, researchers, editors and fileclerks, not for writers who seek a character that they know in speech or recitation. In China today, a new version of the excellent small Xin Sijiaohaoma Cidian, soft cover from Commercial Press, Beijing, has been available since the late 1970s, updated in several new editions and printings. It also uses the Third Revision.
Current usage
[edit]The main purpose of the original four-corner system today is in doing academic research or handling large numbers of characters, terms, index cards, or names. It is also used in computer entry, where a smaller list of items is created to browse from than with other systems. The Xinhua Zidian large type edition is available with a four-corner index for those whose failing eyesight precludes browsing and counting strokes.
In China today, many famous KMT period reference books and collections with four-corner indexes are being reprinted for sale to scholars and those interested in Old Chinese language or historical studies.[citation needed]
See also
[edit]Context
[edit]Uses
[edit]- CKC Chinese Input System, implementation of the four-corner method
- Chinese four-corner index, listing of many Chinese characters sorted by four-corner number
Other structural encodings
[edit]- Chinese telegraph code, a 4-digit system
- SKIP, a structural system for Japanese kanji
- Cangjie method
References
[edit]
External links
[edit]- Four Corner System for Searching Character Indexes and Computer Input explained
- MDBG online Chinese-English dictionary, a dictionary with four corner code support
- An overview of the four corner coding system
- "Information about Wang Yun-Wu". 2009-12-05. Archived from the original on 2012-12-06.
- VimIM 四角號碼中文輸入法
- Graphical four-corner lookup for WWWJDIC.
Four-corner method
View on GrokipediaHistorical Development
Invention and Early Adoption
Wang Yunwu, while serving as editor-in-chief at the Commercial Press in Shanghai, developed the four-corner method in 1928 to address the shortcomings of radical-based indexing, which often required subjective identification of character radicals and limited accessibility for users unfamiliar with etymological structures.[6] The system assigns a four-digit code based on the stroke shapes in the upper-left, upper-right, lower-left, and lower-right corners of a character, enabling a mechanical, stroke-oriented classification that prioritized universality over historical radicals.[1] This innovation stemmed from Wang's efforts to streamline dictionary production and user lookup in an era of expanding print media and literacy campaigns.[7] The method first appeared in Commercial Press dictionaries that year, including revisions to comprehensive character compilations, where it replaced or supplemented traditional methods to expedite indexing of over 40,000 characters.[6] It received formal endorsement from the Republic of China's Ministry of Education in 1928, reflecting governmental support for modernizing lexicographical tools amid broader reforms in education and publishing.[8] Initial adoption spread within Republican China's printing houses and educational institutions, particularly in Shanghai's commercial publishing sector, where it facilitated faster reference in school texts and reference works during the 1930s, though penetration remained uneven due to entrenched familiarity with radical systems. By the mid-1930s, Commercial Press had integrated it into multiple dictionary editions, aiding educators and typesetters in handling the complexities of simplified printing workflows.[7]Versions and Revisions
The Four-corner method originated in its basic form in the early 1920s, with Wang Yunwu proposing an initial classification system based on stroke shapes in the four corners of Chinese characters, assigning numerical codes from 0 to 9 without a dedicated fifth code or extensive mnemonic aids for ambiguities.[9] This version emphasized straightforward encoding of the top-left, top-right, bottom-left, and bottom-right corners, suitable for dictionary indexing but limited in handling overlapped or obscured strokes in denser characters.[10] Over the subsequent three to four years, Wang conducted approximately seventy minor revisions and four major modifications, refining classification rules for stroke identification and introducing mechanisms to address encoding conflicts, culminating in the formalized 1927 publication that established the method's core framework for practical use.[9] A key enhancement in these revisions was the addition of an "attached corner" or fifth code, positioned above the fourth corner to disambiguate cases where primary corners were insufficient, building on earlier ideas from Gao Mengdan's 1925 work on numerical character inspection.[10] [2] Mid-20th-century updates further codified rules for consistency, including standardized stroke categorizations to reduce subjective interpretations in complex glyphs, as documented in revised dictionaries and indexing manuals from the Commercial Press.[9] Following the 1949 establishment of the People's Republic of China, the method persisted in Taiwan with adaptations for traditional characters, maintaining Wang's original encodings, while mainland applications diminished amid the promotion of simplified characters, incorporating modified mnemonics by Huang Weirong to align with evolving orthographic standards.[11]Methodological Principles
Core Encoding Rules
The Four-Corner Method encodes Chinese characters by partitioning each into four quadrants—top-left, top-right, bottom-left, and bottom-right—and assigning a digit from 0 to 9 to each based on the predominant stroke shape or ending in that corner, prioritizing the visual geometry of the outermost strokes for consistent identification independent of radical decomposition.[1] This approach relies on empirical observation of stroke terminations rather than etymological or phonetic components, ensuring lookup reliability through geometric pattern matching.[1] The standardized set comprises ten symbols corresponding to common stroke configurations:| Digit | Description |
|---|---|
| 0 | Top-like elements (e.g., 亠), enclosed full upper or lower sections, or absence of distinct strokes |
| 1 | Horizontal strokes or those tending rightward |
| 2 | Vertical strokes or those tending leftward |
| 3 | Dots or slanting strokes to the lower right |
| 4 | Crossed strokes (vertical with diagonal) |
| 5 | Multiply crossed or intersecting lines |
| 6 | Box-like enclosures |
| 7 | Angular corners or bends |
| 8 | Diverging bifurcations (two branches) |
| 9 | Trifurcating or three-way divergences |
Mnemonics and Symbolic Codes
The four-corner method employs pictographic mnemonics to associate numerical codes with basic stroke shapes, facilitating memorization by linking abstract numbers to visual resemblances in common character components. For instance, code 8 is assigned to shapes resembling the character 八 (bā, "eight"), characterized by intersecting slanting strokes, while code 9 corresponds to forms akin to 小 (xiǎo, "small"), often featuring a short vertical or dot-like element. These symbolic mappings draw from the empirical observation that stroke motifs recur predictably across the over 50,000 Chinese characters cataloged in comprehensive dictionaries, reducing cognitive load by standardizing recognition of corner elements derived from a limited set of approximately 24 stroke types.[1] A traditional aide-mémoire in the system is a mnemonic poem that encapsulates the core stroke-to-number assignments, promoting rapid recall during indexing or lookup. One such verse, documented in reference works on the method, recites: "Horizontal is 1, hanging is 2, and 3 stands for dots and slants; crosses 4, a stroke more 5, and boxes number 6; 7 corners, 8 like 八 'eight', and 9 like 小 'small'; dot above a horizontal stroke is 0 in the front." This rhythmic structure leverages phonetic and visual cues, such as the slanting form of 八 mirroring code 8's diagonal motifs, to encode the system's 10 primary codes (0 through 9), which cover graphical primitives like horizontals (1), verticals (2), dots (3), and enclosures (6).[1] In certain variants, particularly for computational input, numeric codes transition to alphanumeric representations to enhance precision and compatibility with keyboard layouts, where letters substitute for numbers to denote stroke classes (e.g., mapping shapes to keys like 'A' for horizontals). This adaptation preserves the mnemonic foundation while accommodating mechanical constraints, as the underlying symbolic associations remain tied to recurring stroke patterns empirically validated in character corpora. Such devices underscore the method's design for human usability, prioritizing intuitive recall over rote enumeration.[12]Step-by-Step Encoding Process
The four-corner method encodes a Chinese character by analyzing its graphical structure in a fixed sequence, assigning a digit from 0 to 9 to each of the four corners based on the predominant stroke shape or component present, resulting in a four-digit code that facilitates lookup or input.[1] This process prioritizes the visible form over etymological or phonetic elements, ensuring replicability for any standard printed or handwritten character.[13] To begin, divide the character into quadrants corresponding to the upper-left, upper-right, lower-left, and lower-right corners, examining them in that order from top to bottom and left to right.[1] For each corner, identify the key stroke or structural element—such as a horizontal line, vertical stroke, dot, or enclosure—and match it to one of the predefined shape categories. These categories are standardized as follows:| Code | Representative Shapes and Elements |
|---|---|
| 0 | Lid (亠), full upper/lower enclosure, or absence of distinct strokes; also dots or horizontals in some variants.[1][13] |
| 1 | Horizontal strokes or elements extending rightward.[1] |
| 2 | Vertical strokes or elements tending leftward, including hooks.[1][13] |
| 3 | Dots or slanting strokes to the lower right.[1][13] |
| 4 | Crosses formed by vertical and diagonal lines.[1][13] |
| 5 | Skewered or double-crossed lines intersecting multiple elements.[1][13] |
| 6 | Box or square enclosures (e.g., 囗).[1][13] |
| 7 | Angular corners or knock-like structures.[1] |
| 8 | Diverging pairs, such as slashes or eight-like forms.[1][13] |
| 9 | Triple-diverging or small-enclosed structures (e.g., 小).[1][13] |