Recent from talks
Nothing was collected or created yet.
Taishanese
View on Wikipedia| Taishanese | |
|---|---|
| Native to | China, overseas communities particularly in United States and Canada |
| Region | Sze Yup, the Pearl River Delta; United States: historic Chinese communities in Chinatown, San Francisco, other parts of the San Francisco Bay Area of California such as in the San Jose and Oakland areas, Chinatown, Boston and nearby Quincy, Massachusetts, and New York City, Seattle, Washington; Canada: Montreal, Toronto and Vancouver |
| Language codes | |
| ISO 639-3 | – |
| ISO 639-6 | tisa |
| Glottolog | tois1237 |
| Linguasphere | 79-AAA-mbc |
| Taishanese | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Traditional Chinese | 臺山話 | ||||||||||||||||
| Simplified Chinese | 台山话 | ||||||||||||||||
| |||||||||||||||||
Taishanese (simplified Chinese: 台山话; traditional Chinese: 臺山話; pinyin: Táishān huà; Jyutping: toi4 saan1 waa2), alternatively romanized in Cantonese as Toishanese or Toisanese, in local dialect as Hoisanese or Hoisanwa, is a Yue Chinese language native to Taishan, Guangdong.
Even though they are related, Taishanese has little mutual intelligibility with Cantonese. It is not a dialect of Cantonese. Taishanese is also spoken throughout Sze Yup (or Siyi in the pinyin romanization of Standard Mandarin Chinese), located on the western fringe of the Pearl River Delta in Guangdong, China. In the late 19th century and early 20th century, most of the Chinese emigration to North America originated in Sze Yup (which includes Taishan).[1] Thus, up to the mid-20th century, Taishanese was the dominant variety of the Chinese language spoken in Chinatowns in Canada and the United States. It was formerly the lingua franca of the overseas Chinese residing in the United States.[2]
Names
[edit]The earliest linguistic studies refer to the dialect of Llin-nen or Xinning (traditional Chinese: 新寧; simplified Chinese: 新宁).[3] Xinning was renamed Taishan in 1914, and linguistic literature has since generally referred to the local dialect as the Taishan dialect, a term based on the pinyin romanization of Standard Mandarin Chinese pronunciation.[4][5][6][7][8][9] Alternative names have also been used. The term Toishan is a convention used by the United States Postal Service,[10] the Defense Language Institute[11] and the 2000 United States census.[12] The terms Toishan, Toisan, and Toisaan are all based on Cantonese pronunciation and are also frequently found in linguistic and non-linguistic literature.[13][14][15][16] Hoisan is a term based on the local pronunciation, although it is not generally used in published literature.[17]
These terms have also been anglicized with the suffix -ese: Taishanese, Toishanese, and Toisanese. Of the previous three terms, Taishanese is most commonly used in academic literature, to about the same extent as the term Taishan dialect.[18][19] The terms Hoisanese and Hoisan-wa[20] do appear in print literature, although they are used more on the internet.[21][22]
Another term used is Sìyì (Sze Yup or Seiyap in Cantonese romanization; Chinese: 四邑; lit. 'four counties'). Sìyì or Sze Yup refers to a previous administrative division in the Pearl River Delta consisting of the four counties of Taishan, Kaiping, Enping and Xinhui. In 1983, a fifth county (Heshan) was added to the Jiangmen prefecture; so whereas the term Sìyì has become an anachronism, the older term Sze Yup remains in current use in overseas Chinese communities where it is their ancestral home. The term Wǔyì (Chinese: 五邑), literally "five counties", refers to the modern administrative region, but this term is not used to refer to Taishanese.
History
[edit]Taishanese originates in the Taishan region, where it is spoken. Taishanese can also be seen as a group of very closely related, mutually intelligible dialects spoken in the various towns and villages in and around the Siyi region (literally the 'Four Counties' of Toishan, Hoiping, Yanping, Sunwui, transcribed from Standard Cantonese; the names Taishan, Kaiping, Enping and Xinhui, as above, are romanized from Standard Mandarin using Pinyin).
Although this area started undergoing sinicization from the late Han dynasty, Xinhui was decreed as a district during the Northern and Southern dynasties, whilst Enping was established in 622 during the Tang dynasty. Taishan itself was split from Xinhui in 1499, during the Ming dynasty, whilst Kaiping was established in 1649 during the Qing dynasty from territory formerly under Xinhui, Enping, and Xinxing.[23] Thus, as a branch of Yue Chinese, Taishanese is derived from Middle Chinese. Within Siyi, Taishanese proper is closest to the dialect of Kaiping, both phonologically and lexically. It also bears phonological resemblance to the speech of Heshan, a later addition to the region.[23]
A vast number of Taishanese immigrants journeyed worldwide through the Taishan diaspora. The Taishan region was a major source of Chinese immigrants through continental Americas from the late-19th to mid-20th centuries. Taishanese was the predominant dialect spoken by the 19th-century Chinese builders of railroads in North America.[24] Approximately 1.3 million people are estimated to have origins in Taishan.[25] Prior to the signing of the Immigration and Naturalization Act of 1965, which allowed new waves of Chinese immigrants,[26] Taishanese was the dominant dialect spoken in Chinatowns across North America.[20]
As of 2015[update] Taishanese is still spoken in many Chinatowns throughout North America, including those of San Francisco,[27] Oakland, Los Angeles, New York City, Boston, Vancouver, Toronto, Chicago, and Montreal by older generations of Chinese immigrants and their children, but is today being supplanted by mainstream Cantonese and increasingly by Mandarin in both older and newer Chinese communities alike, across the continent.[citation needed]
Relationship with Cantonese
[edit]Taishanese is a language of the Yue branch of Chinese, which also includes Cantonese. However, due to ambiguities in the meaning of "Cantonese" in the English language, as it can refer to both the greater Yue dialect group or its prestige standard (Standard Cantonese), "Taishanese" and "Cantonese" are commonly used in mutually exclusive contexts, i.e. Taishanese is treated separately from "Cantonese". Despite the closeness of the two, they are hardly mutually intelligible.[28][29][30]
The phonology of Taishanese bears a lot of resemblance to Cantonese, since both of them are part of the same Yue branch. Like other Yue dialects, such as the Goulou dialects, Taishanese pronunciation and vocabulary may sometimes differ greatly from Cantonese. Although Taishan stands only 60 miles (100 km) from the city of Guangzhou, they are separated by numerous rivers, and the dialect of Taishan is among the most linguistically distant Yue dialects from the Guangzhou dialect.[31]
Standard Cantonese functions as a lingua franca in Guangdong province, and speakers of other Chinese varieties (such as Chaozhou, Minnan, Hakka) living in Guangdong may also speak Cantonese. On the other hand, Standard Mandarin Chinese is the standard language of the People's Republic of China and the only legally allowed medium for teaching in schools throughout most of the country (except in minority areas), so residents of Taishan speak Mandarin as well. Although the Chinese government has been making great efforts to popularize Mandarin by administrative means, most Taishan residents do not speak Mandarin in their daily lives, but treat it as a second language, with Cantonese being the lingua franca of their region.[citation needed]
Phonology
[edit]Initial consonants
[edit]There are 19 to 23 initials consonants (or onsets) in Taishanese, which is shown in the chart below in IPA:
| Labial | Dental/ Alveolar |
Palatal | Velar | Glottal | ||||
|---|---|---|---|---|---|---|---|---|
| plain | sibilant | plain | ||||||
| Nasal | m1 | n1 | ŋ1 | |||||
| Stop | prenasal | ᵐb1 | ⁿd1 | ᵑɡ1 | ||||
| plain | p | t | t͡s2 | t͡ɕ2 | k | ʔ | ||
| aspirated | pʰ | tʰ | t͡sʰ2 | t͡ɕʰ2 | kʰ | |||
| Fricative | voiceless | f | ɬ | s2 | ɕ2 | h | ||
| voiced | v | ʒ3 | ||||||
| Approximant | l | j3,4 | w5 | |||||
- The respective nasal onsets (/m/, /n/, and /ŋ/) are allophones of the pre-nasalized voiced stop onsets (/ᵐb/, /ⁿd/, and /ᵑɡ/). The velar nasal (ŋ) sound occurs in both syllable initial and syllable final positions. There is a tendency toward denasalization for initial /ŋ/ as in 耳 /ŋi/ [ŋɡi] 'ear', 飲/饮 /ŋim/ [ŋɡim] 'to drink',魚 /ŋuy/ [ŋɡui] 'fish' and 月 /ŋut/ [ŋɡut] 'moon'. In words like 牙 /ŋa/ 'tooth' and 我 /ŋoy/ 'I; me', denasalization does not seem to take place. In syllable final position following the rounded vowel [o], /ŋ/ is usually modified by lip-rounding. Examples are: 東 /uŋ/ 'east' and 紅 /huŋ/ 'red'.
- The palatal sibilants (/t͡ɕ/, /t͡ɕʰ/, and /ɕ/) are allophones of the respective alveolar sibilants (/t͡s/, /t͡sʰ/, and /s/) when the first vowel of the final consonant is high (/i/ and /u/).
- The palatal approximant (/j/) is an allophone of the voiced fricative sibilant initial (/ʒ/).
- The palatal approximant (/j/) can be a semivowel of the vowel /i/ when used as a glide.
- The labial-velar approximant (/w/) can be a semivowel of the vowel /u/ when used as a glide.
Vowels
[edit]There are about seven different vowels in Taishanese:
| Front | Central | Back | |
|---|---|---|---|
| Close | /i/1 | /u/2 | |
| Close-Mid | /e/ | /ə/3 | |
| Open-Mid | /ɛ/ | /ɔ/ | |
| Open | /a/ |
- The closed front vowel (/i/) can be a palatal approximant ([j]) as a semivowel.
- The closed back vowel (/u/) can be a labial-velar approximant ([w]) as a semivowel.
- The rounding of the schwa /ə/ is variable.
Final consonants
[edit]The final consonant (or rime) occurs after the initial sound, which consists of a medial, a nucleus, and a coda. There are three medial (or glides) in Taishanese that occur after the initial sound: null or no medial, /i/, or /u/. There are five main vowels after the medial: /a/, /e/, /i/, /u/, and null or no vowel. There are nine main codas at the end of the final: null or no coda, /i/, /u/, /m/, /n/, /ŋ/, /p/, /t/, and /k/.
| Nucleus | -a- | -e- | -ɵ~ə- | -i- | -u- | -∅- | |||
|---|---|---|---|---|---|---|---|---|---|
| Medial | ∅- | i- | u- | ∅- | ∅ | ∅- | ∅- | ∅- | |
| Coda | -∅ | [a] | [iɛ] | [uɔ] | [i] | [u] | |||
| -i | [ai] | [uɔi] | [ei] | [ui] | |||||
| -u | [au] | [iau] | [eu] | [iu] | |||||
| -m | [am] | [iam] | [em] | [im] | [m] | ||||
| -n | [an] | [uɔn] | [en] | [in] | [un] | ||||
| -ŋ | [aŋ] | [iaŋ] | [ɔŋ] | [ɵŋ] ~ [əŋ] | |||||
| -p | [ap] | [iap] | [ep] | [ip] | |||||
| -t | [at] | [uɔt] | [et] | [ɵt] ~ [ət] | [it] | [ut] | |||
| -k | [ak] | [iak] | [ɔk] | [ɵk] ~ [ək] | |||||
Tones
[edit]Taishanese is tonal. There are five contrastive lexical tones: high, mid, low, mid falling, and low falling.[5] In at least one Taishanese dialect, the two falling tones have merged into a low falling tone.[32] There is no tone sandhi.[10]
| Tone | Tone contour[33] | Example | Changed tone | Chao Number | Jyutping tone number[citation needed] |
|---|---|---|---|---|---|
| high (yin shang) | ˥ (55) | hau˥ 口 (mouth) | (none) | - | 2 |
| mid (yin ping) | ˧ (33) | hau˧ 偷 (to steal) | mid rising | ˧˥ (35) | 1 |
| low (yang ping) | ˨ or ˩ (22 or 11) | hau˨ 頭 (head) | low rising | ˨˥ (25) | 4 |
| mid falling | ˧˩ (31) | hau˧˩ 皓 (bright) | mid dipping | ˧˨˥ (325) | 6 |
| low falling (yang shang) | ˨˩ (21) | hau˨˩ 厚 (thick) | low dipping | ˨˩˥ (215) | 5 |
Taishanese has four changed tones: mid rising, low rising, mid dipping and low dipping. These tones are called changed tones because they are the product of morphological processes (e.g. pluralization of pronouns) on four of the lexical tones. These tones have been analyzed as the addition of a high floating tone to the end of the mid, low, mid falling and low falling tones.[8][32][34][35] The high endpoint of the changed tone often reaches an even higher pitch than the level high tone; this fact has led to the proposal of an expanded number of pitch levels for Taishanese tones.[5] The changed tone can change the meaning of a word, and this distinguishes the changed tones from tone sandhi, which does not change a word's meaning.[4] An example of a changed tone contrast is 刷 /tʃat˧/ (to brush) and 刷 /tʃat˨˩˥/ (a brush).
| Tone name | Level píng 平 |
Rising shǎng 上 |
Departing qù 去 |
Entering rù 入 | |
|---|---|---|---|---|---|
| Upper yīn 陰 |
高 | ˧ (33) | ˥ (55) | ˧ (33) | ˥ (5) |
| 低 | ˧ (3) | ||||
| Lower yáng 陽 |
高 | ˨ or ˩ (22 or 11) | ˨˩ (21) | ˧˨ or ˧˩ (32 or 31) | ˧˨ or ˧˩ (32 or 31) |
| 低 | ˨˩ (21) | ||||
Writing system
[edit]The writing system is Chinese. Historically, the common written language of Classical Literary Chinese united and facilitated cross-dialect exchange in dynastic China, as opposed to the spoken dialects which were too different to be mutually intelligible. In the 20th century, standard written Chinese, based on Mandarin, was codified as the new written standard. As Taishanese is primarily used in speech, characters needed specifically for writing Taishanese are not standardized and may vary. Commonly seen alternatives are shown below.
The sound represented by the IPA symbol ⟨ɬ⟩ (the voiceless alveolar lateral fricative) is particularly challenging, as it has no standard romanization. The digraph "lh" used above to represent this sound is used in Totonac, Chickasaw and Choctaw, which are among several written representations in the languages that include the sound. The alternative "hl" is used in Xhosa and Zulu, while "ll" is used in Welsh. Other written forms occur as well.
The following chart compares the personal pronouns among Taishanese, Cantonese, and Mandarin. In Taishanese, the plural forms of the pronouns are formed by changing the tone,[36] whereas in Cantonese and Mandarin, a plural marker (地/哋/等 dei6 and 们/們 men, respectively) is added.
| Singular | Plural | |||||
|---|---|---|---|---|---|---|
| Taishanese | Standard Cantonese |
Mandarin | Taishanese | Standard Cantonese |
Mandarin | |
| 1st person |
我 ngöi [ŋɔɪ˧] |
我 ngo5 |
我 wǒ |
哦/偔/呆 ngo̖i [ŋɔɪ˨˩] |
我 ngo5 哋 dei6 |
我们/我們 wǒmen |
| 2nd person |
你 nï [nɪ˧] |
你 nei5 |
你 nǐ |
偌/逽/聶 nie̖k [nɪɛk˨˩] |
你 nei5 哋 dei6 |
你们/你們 nǐmen |
| 3rd person |
佢 küi [kʰuɪ˧] |
佢 keoi5 |
他 tā |
㑢/𠳞/佉/劇 kie̖k [kʰɪɛk˨˩] |
佢 keoi5 哋 dei6 |
他们/他們 tāmen |
See also
[edit]References
[edit]- Anderson, Stephen R. (1978), "Tone features", in Fromkin, Victoria A. (ed.), Tone: A Linguistic Survey, New York, NY: Academic Press
- Bauer, Robert S.; Benedict, Paul K. (1997), Modern Cantonese Phonology, Berlin: Walter de Gruyter
- Chao, Yuen-Ren (1951), "Taishan Yuliao", Bulletin of the Institute of Historical Philology (Academia Sinica), 23: 25–76
- Chen, Matthew Y. (2000), Tone Sandhi: Patterns Across Chinese Dialects, Cambridge, UK: Cambridge University Press
- Cheng, Teresa M. (1973), "The Phonology of Taishan", Journal of Chinese Linguistics, 1 (2): 256–322
- Chung, L. A. (2007), "Chung: Chinese 'peasant' dialect redeemed", San Jose Mercury News, San Jose, CA
- Defense Language Institute (1964), Chinese-Cantonese (Toishan) Basic Course, Washington, DC: Defense Language Institute
- Don, Alexander (1882), "The Lin-nen variation of Chinese", China Review: 236–247
- Grimes, John A. (1996). A Concise Dictionary of Indian Philosophy: Sanskrit Terms Defined in English. State University of New York Press. ISBN 0791430677.
- Him, Kam Tak (1980), "Semantic-Tonal Processes in Cantonese, Taishanese, Bobai and Siamese", Journal of Chinese Linguistics, 8 (2): 205–240
- Hom, Marlon Kau (1983), "Some Cantonese Folksongs on the American Experience", Western Folklore, 42 (2), Western Folklore, Vol. 42, No. 2: 126–139, doi:10.2307/1499969, JSTOR 1499969
- Hom, Marlon Kau (1987), Songs of Gold Mountain: Cantonese Rhymes from San Francisco, Berkeley, CA: University of California Press
- Hsu, Madeline Y. (2000), Dreaming of Gold, Dreaming of Home: Transnationalism and Migration between the United States and China, 1882-1943, Palo Alto, CA: Stanford University Press
- Ladefoged, Peter; Maddieson, Ian (1996), The Sounds of the World's Languages, Blackwell Publishing, p. 203, ISBN 0-631-19815-6
- Lee, Gina (1987), "A Study of Toishan F0", Ohio State University Working Papers in Linguistics, 36: 16–30
- Leung, Genevieve Yuek-Ling (2012), Hoisan-wa reclaimed: Chinese American language maintenance and language ideology in historical and contemporary sociolinguistic perspective, Philadelphia, PA: University of Pennsylvania, pp. 1–237 (Ph.D. Dissertation)
- Light, Timothy (1986), "Toishan Affixal Aspects", in McCoy, John; Light, Timothy (eds.), Contributions to Sino-Tibetan Studies, Leiden, Netherlands: Brill, pp. 415–425
- Ma, Laurence; Cartier, Carolyn L., eds. (2003), The Chinese Diaspora: Space, Place, Mobility, and Identity, Rowman & Littlefield, p. 57, ISBN 0-7425-1756-X
- McCoy, John (1966), Szeyap Data for a First Approximation of Proto-Cantonese, Ithaca, NY: Cornell University (Ph.D. Dissertation)
- Ramsey, S. Robert (1987), The Languages of China, Princeton University Press, pp. 23–104, ISBN 0-691-06694-9
- Pulleyblank, Edwin (1984), Middle Chinese: A Study in Historical Phonology, UBC Press, p. 31, ISBN 0-7748-0192-1
- Szeto, Cecilia (2000), "Testing intelligibility among Sinitic dialects" (PDF), Proceedings of ALS2K, the 2000 Conference of the Australian Linguistic Society, retrieved 2008-09-06
- Wong, Maurice Kuen-shing (1982), Tone Change in Cantonese, Champaign, IL: University of Illinois at Urbana-Champaign
- Yang, Fenggang (1999), Chinese Christians in America: Conversion, Assimilation, and Adhesive Identities, Penn State Press, p. 39
- Yip, Moira (2002), Tone, Cambridge, UK: Cambridge University Press
- Yiu, T'ung (1946), The T'ai-Shan Dialect, Princeton, NJ: Princeton University (Ph.D. Dissertation)
- Yu, Alan (2007), "Understanding near mergers: The case of morphological tone in Cantonese", Phonology, 24 (1): 187–214, doi:10.1017/S0952675707001157, S2CID 18090490
- Yue-Hashimoto 余, Anne O. 霭芹 (2005), The Dancun Dialect of Taishan 台山淡村方言研究, Language Information Sciences Research Centre, City University of Hong Kong, ISBN 962-442-279-6
- Notes
- ^ Peter Kwong and Dusanka Miscevic (2005). Chinese America: the untold story of America's oldest new community. The New Press. ISBN 978-1-56584-962-4.
- ^ (Yang 1999)
- ^ (Don 1882)
- ^ a b (Chen 2000)
- ^ a b c (Cheng 1973)
- ^ Cantonese speakers have been shown to understand only about 31% of what they hear in Taishanese (Szeto 2000)
- ^ (Yiu 1946)
- ^ a b (Yu 2007)
- ^ (Anderson 1978)
- ^ a b (Lee 1987)
- ^ (Defense Language Institute 1964)
- ^ "Language code list" (PDF). United States Census, 2000. University of Michigan Library. Archived from the original (PDF) on June 2, 2008.
- ^ (Hom 1983)
- ^ (Light 1986)
- ^ (McCoy 1966)
- ^ (Hom 1987)
- ^ (Grimes 1996)
- ^ (Him 1980)
- ^ (Hsu 2000)
- ^ a b (Leung 2012)
- ^ Taishan (Hoisanese Sanctuary) Archived 2011-07-19 at the Wayback Machine from asianworld.pftq.com
- ^ (Chung 2007)
- ^ a b Tan, Yutian (2017). "Classifying Siyi Cantonese Using Quantitative Approaches". The Ohio State University ProQuest Dissertations & Theses. Retrieved 7 August 2025.
- ^ Chan, Josie (2008). "Who Built the Canadian Pacific Railway? Chinese Workers from Hoisan". S2CID 162063234.
- ^ "Taishan International Web". Archived from the original on June 10, 2008.
- ^ Although the Chinese Exclusion Act was repealed by the signing of the Magnuson Act in 1943, immigration from China was still limited to only 2% of the number of Chinese already living in the United States (Hsu 2000)
- ^ Lucas, Scott (April 21, 2015). "Chinatown Decoded: What Language Is Everybody Speaking?". San Francisco. Archived from the original on November 16, 2016.
- ^ Szeto, Cecilia (2001), "Testing intelligibility among Sinitic dialects" (PDF), in Allan, Keith; Henderson, John (eds.), Proceedings of ALS2k, the 2000 Conference of the Australian Linguistic Society, retrieved 5 Jan 2014
- ^ Phonology of Cantonese - Page 192 Oi-kan Yue Hashimoto - 1972 "... affricates and aspirated stops into consonant clusters is for external comparative purposes, because the Cantonese aspirated stops correspond to /h/ and some of the Cantonese affricates correspond to stops in many Siyi (Seiyap) dialects."
- ^ Language in the USA - Page 217 Charles A. Ferguson, Shirley Brice Heath, David Hwang - 1981 "Even the kind of Cantonese which the Chinese Americans speak causes difficulties, because most of them have come from the rural Seiyap districts southwest of Canton and speak dialects of that region rather than the Standard Cantonese of the city"
- ^ Ramsey 1987, p. 23.
- ^ a b (Wong 1982)
- ^ Chao's tone numbers are generally used in the literature. Each tone has two numbers, the first denotes the pitch level at the beginning of the tone, and the second denotes the pitch level at the end of the tone. Cheng modified the numerical range from 1 (lowest) to 7 (highest): high tone as 66, mid tone as 44, and low tone as 22. In this article Chao's tone letters are used, as they've been adopted by the IPA.
- ^ (Bauer & Benedict 1997)
- ^ (Yip 2002)
- ^ Ramsey 1987, p. 104.
External links
[edit]- Stephen Li. "Taishanese Language Resources". Retrieved 2015-01-05. Taishanese Resources Website
- Stephen Li. "Toisanese Chop Suey 台山话杂碎". Retrieved 2015-01-05. Taishanese Language Blog
- Aaron Lee. "Four Counties 四邑". Retrieved 2015-01-05. Taishanese Language Blog
- C.J. Chow. "Learn Taishanese (台山話)". Archived from the original on 2014-12-17. Retrieved 2015-01-05. You can download the Defense Language Institute's 'Chinese-Cantonese (Toishan) Basic Course' audio and text material here
- Chinese Character to Taishanese Lookup tool Archived 2016-10-19 at the Wayback Machine
- Gene M. Chin. "Hoisanva Sites" Archived 2019-11-09 at the Wayback Machine. Alphabetical Dictionary and Lessons.
Taishanese
View on GrokipediaNames and classification
Names
Taishanese, also known as Taishanhua (台山话) in Mandarin, derives its name from Taishan County in Guangdong Province, China, where it is primarily spoken; the county was renamed from Xinning in 1914, leading to the adoption of "Taishan dialect" in linguistic literature thereafter.[1] The term "Taishanese" reflects the pinyin romanization of Taishan, emphasizing its geographic origin within the Siyi (四邑) region, historically known as Sze Yap or Seiyap in Cantonese.[1][4] Alternative names include Toisanese, a Cantonese-based romanization used in early 20th-century English sources such as U.S. government documents, and Hoisanese, which approximates the local pronunciation; these terms appeared in print as early as the 19th century among overseas communities.[5][4] The broader designation "Sze-yap dialect" historically referred to the varieties spoken across the four counties of Siyi—Taishan, Kaiping, Enping, and Xinhui—encompassing what is now often specified as Taishanese for the Taishan variant.[1][4] Speakers distinguish between self-designations like Hoisanwa (or Hoisan-wa), meaning "Hoisan language," which reflects the endonymic pronunciation of Taishan as Hoisan, and exonyms imposed by outsiders, such as Toisanese in Cantonese-influenced contexts or Taishanese in Mandarin-oriented scholarship.[4][5] Naming conventions evolved significantly from the 19th century onward, tied to waves of immigration from the Siyi region to North America; up to two-thirds of Chinese immigrants to the United States between the 1850s and 1930s originated from this area, leading to the prominence of terms like Toishanese or Hoisan-wa in American English to denote the dominant variety spoken in Chinatowns.[4] This usage persisted in diaspora communities, where the language served as a lingua franca, even as broader labels like "Cantonese dialect" sometimes overshadowed its specificity in later scholarship.[1][4]Linguistic classification
Taishanese is a variety of Yue Chinese, belonging to the Sinitic branch of the Sino-Tibetan language family, and is specifically classified within the Siyi (also known as Sze-yap or Ng Yap) subgroup of Yue.[6] This subgroup encompasses dialects spoken in the four counties of Taishan, Kaiping, Enping, and Xinhui in western Guangdong province, China, distinguishing it from the Guangfu subgroup that includes Standard Cantonese spoken in Guangzhou.[6] As part of Yue, Taishanese shares genetic relations with other dialects in the family, such as those in the Eastern (Guangfu and Fenglian) and Western (Yongxun) branches, marked by common innovations including the presence of sonorant initials in both yin and yang tone registers and the split of the entering (ru) tone into high and low registers conditioned by vocalic features.[6] Linguists debate whether Taishanese constitutes a dialect of Cantonese or a separate language, primarily based on criteria like mutual intelligibility and shared innovations versus retentions. While Taishanese is often grouped under the broader "Cantonese" umbrella due to its Yue affiliation, it exhibits limited mutual intelligibility with Standard Cantonese, with speakers able to recognize some vocabulary but struggling with connected speech owing to phonological and lexical divergences.[1] Some scholars propose elevating the Siyi dialects, including Taishanese, to a distinct branch within Sinitic languages, citing unique features like prenasalized stops (e.g., /mb/, /nd/, /ŋg/) and tone mergers (e.g., upper even tones merging with lower departing tones) that diverge from core Yue patterns, though this view challenges the traditional Yue classification.[7] Within Taishanese, several subdialects exist, reflecting local variations across towns in the Siyi region, such as those from Doushan and Shuibu in Taishan county. These subdialects show phonological differences, including variations in tone contours, initial consonants, and vowel realizations—for instance, the Dancun subdialect of Taishan features distinct tone splits and nasalization patterns compared to urban Taishanese.[3] Such internal diversity underscores Taishanese's status as a dialect cluster rather than a monolithic variety, with studies highlighting affixal aspects and tone perception as key areas of variation.[3]History and geographic distribution
Historical development
The origins of Taishanese, a dialect of the Yue branch of Sinitic languages, trace back to the ancient Yue peoples who inhabited southern China, particularly in the coastal regions of modern Guangdong province. During the Qin dynasty's conquest in 214 BCE and the subsequent Han dynasty expansion (206 BCE–220 CE), large-scale migrations of Han Chinese into Yue territories facilitated linguistic assimilation, blending indigenous Yue substrates with incoming Sinitic elements derived from Middle Chinese.[8] This process laid the foundation for Yue dialects, including Taishanese, which evolved through ongoing contact and regional isolation in the Siyi (Four Counties) area around Taishan.[8] The mid-19th century marked a pivotal phase in Taishanese's development abroad, driven by economic hardships, natural disasters, and opportunities such as the California Gold Rush (1848–1855), alongside the Taiping Rebellion (1850–1864), a devastating civil war that ravaged Guangdong province and exacerbated conditions in the Siyi region. The rebellion's violence, which claimed millions of lives and disrupted local agriculture and trade, contributed to massive emigration waves from Taishan and surrounding areas to Southeast Asia, North America, and beyond, establishing Taishanese as a key lingua franca in early Chinese diaspora communities.[9] These migrations preserved and propagated the dialect outside China, influencing its lexical and cultural adaptations through contact with host languages. In the early 20th century, Western missionaries in rural Guangdong produced vernacular materials, including romanized scripts and glossaries for local Yue dialects, to support evangelism. These publications aided religious dissemination and contributed to phonological and orthographic documentation, bridging oral traditions with written forms. The Chinese Civil War culminating in 1949 profoundly impacted Taishanese speakers through widespread displacement from mainland China, as communist victory prompted further outflows to Hong Kong, Taiwan, and overseas enclaves. This exodus, involving many from Guangdong's Siyi districts, bolstered Taishanese communities in diaspora settings, where the language served as a marker of identity and facilitated preservation amid pressures from dominant languages like Mandarin and English.[10]Geographic distribution
Taishanese, a variety of Yue Chinese, is primarily spoken in the Sze-yap (Siyi) region of Jiangmen prefecture in Guangdong Province, southern China, with its core homeland centered in Taishan city.[11] The language is native to Taishan, where the local population of approximately 907,000 residents (as of the 2020 census) predominantly uses Taishanese dialects in daily communication.[12] It extends to the surrounding Sze-yap counties of Kaiping (population around 749,000), Enping (approximately 484,000), and Xinhui (about 735,000), forming a contiguous area where Taishanese varieties are spoken by an estimated 2-3 million people collectively (as of 2020).[13][14][15] These dialects exhibit regional variations, such as the Huicheng subdialect in Taishan and distinct phonetic traits in Kaiping and Enping, though mutual intelligibility remains high within the Sze-yap cluster.[1] Taishanese also appears in nearby urban centers like Guangzhou, where migrant workers from the Sze-yap area contribute to its use in informal settings.[16] Beyond its homeland, Taishanese has a significant diaspora resulting from large-scale migrations during the 19th and 20th centuries, driven by economic opportunities in labor-intensive industries abroad.[1] In the United States, over half a million speakers (as of 2014) maintain communities, particularly in historic Chinatowns of San Francisco and New York City, where Taishanese served as the primary Chinese language among early immigrants.[17] Similar patterns exist in Canada, with notable populations in Vancouver and Toronto; Australia, especially in Sydney and Melbourne; and Southeast Asia, including Malaysia, Indonesia, and Vietnam, where Taishanese speakers number in the hundreds of thousands across these regions.[18] Overall, the global Taishanese diaspora includes an estimated 1.3 million people, many of whom continue to use the language in family and community contexts.[19] In mainland China, Taishanese faces increasing endangerment due to government policies promoting Mandarin as the standard language in education, media, and official communication since the 1980s. As of the 2020s, this shift has led to a decline in intergenerational transmission, with fluent speakers now concentrated among the elderly population, particularly in urbanizing areas of the Sze-yap region where younger generations prioritize Mandarin for socioeconomic mobility, though some revitalization efforts persist in diaspora communities.[1] In rural Taishan and surrounding counties, the language persists more robustly among older adults, but urban migration and Mandarin-dominant schooling have reduced its vitality among those under 50.[20]Phonology
Consonants
Taishanese features an inventory of approximately 19 to 21 initial consonants, which serve as syllable onsets. These include voiceless stops at bilabial (/p/), alveolar (/t/), velar (/k/), and palatal (/tɕ/) places of articulation, each with their aspirated counterparts (/pʰ/, /tʰ/, /kʰ/, /tɕʰ/); nasal stops /m/, /n/, and /ŋ/; fricatives /f/, /s/, and /h/; affricates /ts/ and /tsʰ/; the lateral approximant /l/; glides /w/ and /j/; and the glottal stop /ʔ/.[21] This system reflects the retention of Middle Chinese distinctions in aspiration and place, though Taishanese lacks the voiced stops found in some other Sinitic varieties. Distinctive innovations include a shift of /s/ to [ɬ] (lateral fricative) in certain environments and /tʰ/ to in others.[1] Final consonants, or codas, are restricted but include nasals /m/, /n/, and /ŋ/; approximants /w/ and /j/; and unreleased stops /p̚/, /t̚/, /k̚/ in checked syllables. Unlike Mandarin, which lacks stop codas, Taishanese retains these unreleased stops from Middle Chinese entering tones, resulting in syllables that may close with stops, nasals, or glides.[21] The syllable structure is generally CV(C), where C represents a consonant, V a vowel, and the optional coda limited to nasals, approximants, or unreleased stops; zero-initial syllables (V or V(C)) also occur but often trigger resyllabification processes like gemination of preceding nasals or glides.[2] In contemporary Taishanese speech, a notable merger exists between the alveolar nasal /n/ and lateral /l/ initials, typically realized as or a flap [ɾ] in many dialects, reducing the functional contrast inherited from earlier stages.[21]Vowels and tones
Taishanese possesses a vowel system comprising five primary monophthongs: /i/, /e/, /a/, /o/, and /u/. These are illustrated in lexical items such as dei (/dei/) 'two', gu (/gu/) 'five', moi (/moi/) 'mother', and oi (/oi/) 'love'. Diphthongs include /ei/, as in ìei (/ìei/) 'younger sister'. Nasalized variants are also present, particularly in syllables ending with nasal codas, for example hON (/hɔn/) 'red'.[22] The tone system of Taishanese typically features five contrastive lexical tones in open syllables: high level (55), mid level (33), low level (22), mid falling (32), and low falling (21). Checked syllables, which end in an unreleased stop consonant (/p̚/, /t̚/, /k̚/), exhibit three tones derived from Middle Chinese entering tones. These tones play a crucial role in lexical distinction, with the overall system reflecting eight tonal categories when including checked tones.[23] Taishanese maintains register distinctions inherited from Yue dialects, dividing tones into upper (yin) and lower (yang) registers, which can affect vowel quality and pitch height; for instance, upper register tones tend to be higher in pitch and may associate with tense vowels, while lower register tones are lower and link to lax vowels. Tone sandhi occurs in certain contexts, such as rising changed tones acquiring an additional rising contour before other syllables, though it is less extensive than in some other Yue varieties.[23]Grammar
Syntax
Taishanese follows a canonical Subject-Verb-Object (SVO) word order in simple declarative sentences, consistent with other Yue varieties. For instance, a basic sentence like "Ngoh heung faan" (I eat rice) places the subject "ngoh" (I) before the verb "heung" (eat) and object "faan" (rice). However, like many Sinitic languages, Taishanese permits topic-comment flexibility, where the topic—often a noun phrase—is fronted for emphasis or discourse purposes, followed by a comment clause that predicates information about it. An example is "Faan ngoh heung" (As for rice, I eat it), shifting focus to the object as topic while maintaining underlying SVO relations in the comment. This structure enhances pragmatic coherence but does not alter the core SVO alignment.[24] Nouns in Taishanese typically require classifiers when enumerated, demonstratively modified, or possessed, reflecting the language's classifier system shared with other Chinese varieties. The general classifier "go3" (cognate with Mandarin "ge4") is commonly used for persons or general items, as in "jat1 go3 jan4" (one person), where "jat1" (one) precedes the classifier "go3" before the noun "jan4" (person). Aspect is conveyed through post-verbal markers rather than inflectional changes, underscoring Taishanese's morphological simplicity (detailed further in the morphology section). The perfective aspect, indicating completion, is marked by "a" (or variably "ah" in some subdialects), equivalent to Mandarin "le" and differing from Cantonese's "zo2"; for example, "Ngoh heung a faan" (I have eaten rice) signals the action's boundedness and result.[25][2] Questions in Taishanese are formed using sentence-final particles for yes/no inquiries, with "ma" conveying an enquiring or confirmatory tone. A simple yes/no question appears as "Loi ma?" (Are you coming?), appending "ma" to the verb "loi" (come) without inverting word order. Wh-questions maintain SVO structure with the interrogative in situ, such as "Nei bin go3?" (Who are you?), where "bin" (who) replaces the subject or object. Verb serialization is prevalent, allowing multiple verbs to chain without conjunctions to express complex actions, as in "Ngoh hoi1 maai5" (I go buy), combining motion "hoi1" (go) and purpose "maai5" (buy) in sequence.[26][24] Relative clauses precede the head noun and are linked by the particle "ge2" (pronounced differently from Cantonese's "ge3" in some contexts), functioning as a nominalizer without a relative pronoun. For example, "Jan4 [ngoh jing1 go3] ge2" (the person [that] I know) modifies "jan4" (person) with the embedded clause "ngoh jing1 go3" (I know him/her), contrasting to Cantonese's structurally identical but phonetically distinct "ge3" marker, which may carry a more assertive tone in urban varieties. This pre-nominal positioning aligns with Sinitic typology, enabling compact noun phrases.[24][27]Morphology and word formation
Taishanese, like other Sinitic languages, is predominantly analytic in its morphological structure, relying on word order and particles rather than inflectional affixes to convey grammatical relations. Verbs lack markings for tense, aspect, or agreement, with such categories expressed through contextual elements or aspectual particles instead.[28] Compounding serves as the primary mechanism for word formation in Taishanese, where new words are created by juxtaposing morphemes, often resulting in disyllabic or polysyllabic forms. This reflects the language's preference for combining roots to derive novel meanings. For instance, the compound ziu2 sui3 (swim-water) denotes "to swim," illustrating a verb-object structure typical in Taishanese derivation.[29][30] Reduplication is another productive process in Taishanese morphology, used to indicate iteration, intensity, or diminutives across word classes. For verbs, reduplication often conveys an iterative or delimitative aspect, as in tsein33 tsein33 (weigh-weigh), which implies trying or briefly performing the action of weighing. Adjectival reduplication with tone sandhi expresses gradations of quality, such as hǝŋ22 hǝŋ225 (red-red↑, somewhat red) for mild intensity or hǝŋ225 hǝŋ22 (red↑-red, very red) for strong emphasis. Noun reduplication is more restricted, typically denoting "every" or "each," as seen in ŋit33 ŋit21 (day-day, every day). Diminutives can also arise through partial reduplication, like ǝŋ33 bein553 (cold-ice↓, slightly cold).[31] Derivational affixes in Taishanese are limited and largely borrowed from Classical Chinese, functioning to modify lexical categories or add nuances like familiarity or quantity. Prefixes such as a33 (阿) nominalize or familiarize terms, for example, a33 gɔ55 (a-elder brother, bro). Suffixes like dɔi55 (仔) indicate smallness or youth, as in gi33 dɔi55 (ghost-small, young foreigner). These affixes contribute to word formation but are less pervasive than compounding or reduplication.[32]Vocabulary and lexicon
Core lexicon
The core lexicon of Taishanese, a Yue variety of Sinitic, largely inherits basic terms from Middle Chinese, particularly in domains like numbers, body parts, and kinship, though pronunciations diverge from Standard Mandarin due to regional phonological developments. For instance, cardinal numbers reflect shared Sinitic etymologies but with Yue-specific tones and initials: "one" is pronounced yat¹ (from Middle Chinese ʔit), "two" as ŋi⁵ or ngi², "three" as lɑm², contrasting with Mandarin yī, èr, and sān respectively. These forms underscore Taishanese's retention of ancient features like initial nasals and entering tones absent in northern varieties.[33] Body parts and kinship terms similarly draw from common Sinitic roots, adapted to Taishanese phonology, emphasizing familial hierarchy common across Chinese languages. Examples include "head" as hau⁴, "hand" as siu², and "foot" as kuek⁶, mirroring Cantonese cognates but with distinct tonal contours; for kinship, "mother" is mā¹ and "father" bā¹, with extended terms like "paternal grandfather" as toō⁵foò¹. Such vocabulary highlights the language's conservative preservation of Sino-Tibetan structures for personal and relational concepts.[34][35] Taishanese exhibits unique innovations within the Yue group, particularly in terms for local flora and fauna tied to Guangdong's subtropical environment, diverging from broader Sinitic norms. These terms often blend inherited roots with substrate influences from pre-Sinitic Baiyue languages.[36] Reflecting Taishan's agrarian and coastal heritage, the lexicon in semantic fields of agriculture and fishing features specialized terms that evoke rural livelihoods, such as words for rice cultivation tools or marine life, setting it apart from urban Cantonese varieties. Basic verbs like "eat" (hɛk⁶) differ from Cantonese's sɪk⁶, using a form closer to Mandarin chī but with Yue tonality, while sharing high overlap in core vocabulary such as numerals and body parts but variances in daily actions. This blend of shared inheritance and local adaptation underscores Taishanese's distinct identity within Yue.[35]Influences and loanwords
Taishanese vocabulary exhibits a substratum influence from ancient Baiyue languages, particularly in terms related to agriculture and rice cultivation, reflecting the historical assimilation of non-Sinitic peoples in southern China. For instance, the Taishanese term for glutinous rice, no4, derives from Proto-Tai C.no:w, a feature shared with other Yue varieties due to early contact with Tai-Kadai speaking groups among the Baiyue. This substratum is identified through phonological and lexical correspondences that deviate from standard Sinitic patterns, contributing to unique agricultural lexicon in the dialect.[37] Loanwords from Portuguese entered Taishanese through colonial contacts in nearby Macau, where Portuguese administration from the 16th century facilitated lexical borrowing into local Yue speech. Examples include ma5gei3jau4 (salted cod) from Portuguese bacalhau, and gaam1 (almond) from amêndoa, often adapted to fit Taishanese phonology while retaining semantic ties to trade and cuisine. These borrowings, more prevalent in Macau-influenced varieties, highlight the impact of European commerce on everyday vocabulary.[38] In diaspora communities, particularly among Taishanese speakers in North America and Southeast Asia, English loanwords have integrated extensively due to immigration and economic interactions since the late 19th century. Terms like ka1 for "car" and poh4si1 for "boss" exemplify this, with adaptations such as vowel lengthening (a in "car" to open syllable [ka:]) and epenthesis (-si in "boss" to [pɔ:si:]) to align with Taishanese syllable structure and tones. These nativizations preserve core meanings while conforming to the dialect's phonological constraints.[39] Post-1949 language policies in the People's Republic of China promoting Putonghua (Mandarin) as the standard have introduced administrative and technical terms into Taishanese, especially in official and educational contexts within Guangdong. Borrowings such as dan1wei2 (work unit) and zheng4fu3 (government) from Mandarin reflect this convergence, often adopted directly for bureaucratic precision while coexisting with native equivalents in informal speech. This influence underscores the dialect's adaptation to national standardization efforts.[40]Writing system and orthography
Traditional writing
Taishanese relies on Chinese characters (traditional in historical texts and diaspora communities, simplified in modern mainland China) drawn from the standard literary corpus to represent its spoken forms, with writers selecting homophones that approximate the dialect's phonology, particularly its distinctive initials like the voiceless alveolar lateral fricative [ɬ] and tonal patterns. This approach mirrors practices in other Yue dialects, where characters are chosen not solely for semantic meaning but for phonetic alignment, allowing the logographic system to encode dialectal speech without a dedicated script. For example, unique Taishanese terms may employ characters that sound similar in Cantonese due to shared phonological features, facilitating readability across related varieties.[41] Vernacular literature in Taishanese dates to the late 19th and early 20th centuries, manifesting in folk songs and regional documentation transcribed with Chinese characters to capture local idioms and oral traditions. A notable example is the 1929 collection Taishan Geyao Ji, edited by Chen Yuanzhu, which compiles Taishanese folk songs using dialect-specific character selections to reflect pronunciation and cultural content, such as rural life and emigration themes. Local gazetteers from Taishan, like those compiled in the early 20th century, integrate vernacular elements—such as glosses for regional terminology—within primarily classical Chinese frameworks to describe customs, geography, and migration patterns.[42] Writing Taishanese faces challenges from the absence of standardized characters for dialect-exclusive words, resulting in inconsistent representations and heavy borrowing from Cantonese orthographic conventions, where shared vocabulary allows approximate phonetic matching but obscures purely Taishanese innovations. This reliance often leads to hybrid forms that prioritize intelligibility over precision, limiting the development of a distinct literary tradition.[43] Historical texts from the Taishan region and diaspora communities further illustrate this character-based tradition, including migrant letters from the late 19th and early 20th centuries that blend classical phrasing with vernacular expressions in Chinese characters to convey personal and business matters. In the 1920s, local publications like the Nanshe Monthly in Taishan utilized mixed classical and vernacular scripts to engage both residents and overseas Taishanese, fostering community ties through accessible prose on migration and local affairs.[41][44]Romanization systems
Taishanese lacks a single standardized romanization system, with various schemes developed for linguistic documentation, military training, and community preservation. One influential early system appears in the Defense Language Institute's 1964 Chinese-Cantonese (Toishan) Basic Course, which employs a Latin alphabet-based transcription with diacritics to represent tones and distinctive sounds. This system uses conventions such as "ng" for the velar nasal initial /ŋ/ and final consonants like -p, -t, -k for checked syllables, facilitating pronunciation for non-native learners.[45] Tones in this scheme are marked with diacritics, such as grave () for low tones and other accents for falling or rising contours, distinguishing Taishanese's five contrastive lexical tones (high level, mid level, low level, mid falling, and low falling), with additional changed tones in some analyses, from the six tones of standard [Cantonese](/page/Cantonese). For example, the word for "yesterday" is romanized as *dohmáhn* to indicate its low-falling then rising tone pattern. This diacritic-heavy approach contrasts with Jyutping, the standard romanization for Cantonese developed by the Linguistic Society of Hong Kong, which uses superscript numbers (1-6) for tones without accents. Taishanese adaptations of Jyutping often require modifications, such as unique markers for the high-falling tone (not distinctly numbered in Jyutping) and additional symbols for dialect-specific features like the lateral fricative /ɬ/.[3]
Since the 1990s, romanization systems have played a key role in diaspora communities, particularly among Taishanese speakers in the United States, where the language was dominant among early Chinese immigrants. These systems support informal education in community centers and heritage language programs, helping second- and third-generation speakers maintain fluency amid language shift toward English and Mandarin. Online resources, including vocabulary archives and audio lessons, increasingly employ hybrid romanizations based on the DLI scheme or Jyutping adaptations to provide accessible learning materials for global users. Recent online dictionaries, such as Taishanese Language Home (as of 2023), employ hybrid romanizations combining Jyutping elements with IPA for audio-linked learning.[46][47]
