Recent from talks
Nothing was collected or created yet.
Kra languages
View on Wikipedia| Kra | |
|---|---|
| 仡央 Geyang | |
| Geographic distribution | Southern China, Northern Vietnam |
| Linguistic classification | Kra–Dai
|
| Proto-language | Proto-Kra |
| Language codes | |
| Glottolog | kada1291 |
The Kra languages (/krɑː/ KRAH; also known as the Geyang or Kadai languages) are a branch of the Kra–Dai language family spoken in southern China (Guizhou, Guangxi, Yunnan) and in northern Vietnam (Hà Giang Province).
Names
[edit]The name Kra comes from the word *kraC[1] "human" as reconstructed by Ostapirat (2000), which appears in various Kra languages as kra, ka, fa or ha. Benedict (1942) used the term Kadai for the Kra and Hlai languages grouped together and the term Kra-Dai is proposed by Ostapirat (2000).
The Kra branch was first identified as a unified group of languages by Liang (1990),[2] who called it the Geyang 仡央 languages. Geyang 仡央 is a portmanteau of the first syllable of Ge- in Gelao and the last syllable of -yang in Buyang. The name Kra was proposed by Ostapirat (2000) and is the term usually used by scholars outside China, whereas Geyang is the name currently used in China.
Significance
[edit]Several Kra languages have regionally unusual consonant clusters and sesquisyllabic or disyllabic words, whereas other Kra–Dai languages tend to have only single syllables. The disyllables in Buyang have been used by Sagart (2004)[3] to support the view that the Kra-Dai languages are a subgroup within the Austronesian family. Unlike the Tai and Kam–Sui languages, most Kra languages, including Gelao and Buyang, have preserved the proto-Kra–Dai numerical systems. The only other Kra–Dai branch that preserves this is Hlai.[4] Most other Kra–Dai languages adopted Chinese numerals over 1000 years ago.
As noted by Jerold A. Edmondson, the Kra languages contain words in metalworking, handicrafts and agriculture that are not attested in any other Kra–Dai language.[5] This suggests that the Kra peoples may have developed or borrowed many technological innovations independently of the Tai and Kam-Sui peoples.
Reconstruction
[edit]The Proto-Kra language has been reconstructed by Weera Ostapirat (2000).
Classification
[edit]Morphological similarities suggest the Kra languages are closest to the Kam–Sui branch of the family. There are about a dozen Kra languages, depending on how languages and dialects are defined. Gelao, with about 8,000 speakers in China out of an ethnic population of approximately 500,000, and consists of at least four mutually unintelligible language varieties, including Telue (White Gelao), Hagei (Blue or Green Gelao), Vandu (Red Gelao), A'ou (Red Gelao), and Qau (Chinese Gelao).
Ostapirat (2000)
[edit]The internal classification below is from Weera Ostapirat (2000), who splits the Kra branch into the Eastern and Western branches.
| Kra | |
According to Jerold Edmondson (2002), Laha is too conservative to be in Western Kra, considered it to constitute a branch of its own. However, Edmondson (2011)[6] later reversed his position, considering Laha to be more closely related to Paha.
Ethnologue mistakenly includes the Hlai language Cun of Hainan in Kra; this is not supported by either Ostapirat or Edmondson.
Hsiu (2014)
[edit]Hsiu's (2014)[7] classification of the Kra languages, based on computational phylogenetic analysis as well as Edmondson's (2011)[6] earlier analysis of Kra, is given below, as cited in Norquest (2021).[8]
Substrata
[edit]Andrew Hsiu (2013, 2017) reports that Hezhang Buyi, a divergent, moribund Northern Tai language spoken by 5 people in Dazhai 大寨, Fuchu Township 辅处乡, Hezhang County 赫章县, Guizhou, China, has a Kra substratum.[9]
Maza, a Lolo–Burmese language spoken in Mengmei 孟梅, Funing County, Yunnan, is also notable for having a Qabiao substratum (Hsiu 2014:68-69).[10]
According to Li Jinfang (1999),[11] the Yang Zhuang people of southwestern Guangxi may have been Kra speakers who had switched to Zhuang.
Demographics
[edit]The Kra languages have a total of about 22,000 speakers.[5] In Vietnam, officially recognized Kra peoples are the Cờ Lao, La Chí, La Ha and Pu Péo. In China, only the Gelao (Cờ Lao) have official status. The other Kra peoples are variously classified as Zhuang, Buyi, Yi, and Han.
"Hotspots" for Kra languages include: within China, most of western Guizhou, the prefecture-level city of Baise in western Guangxi, and Wenshan Zhuang and Miao Autonomous Prefecture in southeastern Yunnan; as well as northern Vietnam's Hà Giang Province. This distribution runs along a northeast-southwest geographic vector, forming what Jerold A. Edmondson calls a "language corridor."[5]
Multilingualism is common among Kra language speakers. For example, many Buyang can also speak Zhuang.[12]
- Western
- Lachi (拉基, La Chí) – 10,300 (7,863 in Vietnam in 1990; 2,500 in Maguan County, Yunnan, China in 1995)
- Gelao (仡佬, Cờ Lao) – 7,900 (spoken in Guizhou, Longlin Various Nationalities Autonomous County in Guangxi, and northern Vietnam)
- Laha (拉哈, La Ha) – 1,400 (officially recognized in Vietnam; most divergent western Kra language)
- Eastern
- Buyang 布央 dialect cluster – 2,000
- Paha 巴哈 (considered a separate language by Ostapirat; spoken in Yangliancun 央连村, Diyu Township, Guangnan County 广南县, Yunnan)
- Langjia 郎架 (spoken in Langjia, Funing County, Yunnan along the Guangxi border)
- Ecun 峨村 (spoken in Ecun, Funing County, Yunnan along the Guangxi border)
- Yalang 雅郎 (Yalhong; spoken in Rongtun 荣屯, Napo County, Guangxi)
- Qabiao (Pubiao 普标, Pu Péo) – 700
- En (Nùng Vên; spoken in northern Vietnam) – 250
- Buyang 布央 dialect cluster – 2,000
Numerals
[edit]| Language | One | Two | Three | Four | Five | Six | Seven | Eight | Nine | Ten |
|---|---|---|---|---|---|---|---|---|---|---|
| (Proto-Austronesian) | *isa | *duSa | *telu | *Sepat | *lima | *enem | *pitu | *walu | *Siwa | *sa-puluq |
| Proto-Kra | *tʂəm C | *sa A | *tu A | *pə A | *r-ma A | *x-nəm A | *t-ru A | *m-ru A | *s-ɣwa B | *pwlot D |
| Buyang, Baha | tɕam45 | θa322 | tu322 | pa322 | m̥a33 | nam31 | ðu33 | mu31 | dʱa33 | pʷat55 |
| Buyang, Ecun | pi53 | θa24 | tu24 | pa24 | ma44 | nam24 | tu44 | ma0 ðu44 | va55 | put55 |
| Buyang, Langjia | am35 | ɕa54 | tu54 | pa54 | ma312 | nam54 | ðu312 | ma0 ðu312 | va11 | put55 |
| Buyang, Yerong | ɔm55 | θau53 | taːi53 | po53 | mo43 | naːm53 | təu31 | ɬəu43 | vo55 | pɔt55 |
| En (Nung Ven) | ʔam332 | θa243 | tu243 | pa33 | ma243 | nəm243 | ʔam332 tu243 | me332 ru33 | wa54 | θət33 |
| Qabiao | tɕia33 | ɕe53 | tau53 | pe53 | ma33 | ma33 nam35 | ma33 tu53 | ma33 ʐɯ33 | ma33 ɕia31 | pət31 |
| Laha, Wet | tɕɐm31 | sa343 | tu343 | pɑ343 | mɑ33 | dɐm343 | tʰo343 | ma33 hu33 | so33 wa24 | pɤt23 |
| Laha, Dry | cạm6 | śa5 | tợw3 | pa3 | ha6 | hôk4 | cêt4 | pet4 | kạw6 | śêp4 |
| Lachi | tɕa33 | su11 | te11 | pu11 | m̩11 | ȵiã11 | te24 | ŋuɛ11 | liu24 | pɛ11 |
| Gelao, Bigong | sɿ55 təɯ33 | səɯ31 təɯ33 | tɔ31 | pɔ31 | mɔ31 | nai31 | tʰɔ31 | ʑɔ31 | ʑɔu31 | hui13 |
| Gelao, Moji | tsɿ53 | səu31 | ta31 | pu31 | mlau31 | tɕʰau31 | xei31 | xe31 | kəu31 | tsʰei53 |
| Gelao, Puding | se55 | so55 | tua55 | pu45 | mu53 | naŋ53 | ɕi33 | vra53 | su33 | paɯ33 |
| Gelao, Pudi | sɪ55 | səɯ42 | tji42 | pau42 | mau31 | mjaŋ31 | te42 | ɣe31 | sau13 | ɕye13 |
| Gelao, Red | tsə44 | se33 | tua44 | pu44 | maŋ44 | ɬoŋ44 | te44 | wu35 | ʂe35 | la51 kwe44 |
| Gelao, White[14] | tsɿ33 | sɯn35 | tau55 | pu55 | mlən35 | tɕʰau55 | hi55 | ɕiau55 | ku55 | tɕʰiu33 |
| Gelao, Sanchong | ʂɿ43 | ʂa45 | tau45 | pu45 | mei21 | ȵaŋ21 | tʂau45 | ʑau21 | ʂo43 | sɿ43 pie43 |
| Gelao, Wanzi | si33 | su33 | ta33 | pu33 | mpu44 | nan33 | ɕi24 | vla44 | səɯ24 | pe24 |
| Mulao[15] | tsɿ53 | ɬu24 | ta24 | pʰu24 | mu31 | ȵe31 | sau31 | ɣau31 | so24 | ve53 |
| Gelao, Heijiaoyan[16] | sɿ44 | sɑ44 | tuu44 | pu44 | - | - | - | - | - | - |
| Gelao, Jianshan[16] | ʐɤ42 | sw42 | tuɑ42 | pu44 | - | - | - | - | - | - |
| Gelao, Banliwan[16] | i53 | ɑ53 | ɑ53 muŋ53 | ɑŋ44 | - | - | - | - | - | - |
| Gelao, Zunyi[16] | 失 (shi) | 沙 (sha) | 刀 (dao) | 波 (bo) | 媒 (mei) | 娘召 (niangshao) | 召 (shao) | 饶 (rao) | 署 (shu) | 失不 (shibu) |
| Gelao, Renhuai[16] | 思 (shi) | 沙 (sha) | 刀 (dao) | 波 (bo) | 差 (cha) | 良 (liang) | - | 绕 (rao) | 素 (su) | 死比 (sibi) |
Notes
[edit]- ^ Note: The superscript C is a reconstructed tone, typically considered to be historically derived from a glottal stop (*-ʔ).
- ^ Liang Min 梁敏. 1990 "Geyang yuqun de xishu wenti 仡央语群的系属问题 / On the affiliation of the Ge-Yang group of languages." In Minzu Yuwen 民族语文 1990(6): 1-8.
- ^ Sagart, Laurent. 2004. The Higher Phylogeny of Austronesian and the Position of Tai-Kadai.
- ^ Norquest, Peter Kristian (2007). A Phonological Reconstruction of Proto-Hlai (PhD thesis). University of Arizona. hdl:10150/194203.
- ^ a b c Diller, Anthony, Jerold A. Edmondson, and Yongxian Luo ed. The Tai–Kadai Languages. Routledge Language Family Series. Psychology Press, 2008.
- ^ a b Edmondson, Jerold. 2011. Notes on the subdivisions in Kra Archived 2015-10-02 at the Wayback Machine. Published as Geyang yuyan fenlei buyi 仡央语言分类补议 in Journal of Guangxi University for Nationalities. 广西民族大学学报. 33.2.8-14.
- ^ Hsiu, Andrew. 2014. Kra-Dai notes.
- ^ Norquest, Peter (2021). "Classification of (Tai-)Kadai/Kra-Dai languages". The Languages and Linguistics of Mainland Southeast Asia. De Gruyter. pp. 225–246. doi:10.1515/9783110558142-013. ISBN 9783110558142. S2CID 238672319.
- ^ Hsiu, Andrew. 2013. “Shui” varieties of western Guizhou and Yunnan. Presented at the 46th International Conference on Sino-Tibetan Languages and Linguistics (ICSTLL 46), Dartmouth College, Hanover, New Hampshire, United States, August 7–10, 2013 (Session: Tai-Kadai Workshop).
- ^ Hsiu, Andrew. 2014. "Mondzish: a new subgroup of Lolo-Burmese Archived 2016-03-03 at the Wayback Machine". In Proceedings of the 14th International Symposium on Chinese Languages and Linguistics (IsCLL-14). Taipei: Academia Sinica.
- ^ Li Jinfang (1999). Studies in the Buyang Language. Beijing: Central University for Nationalities Press.
- ^ 李锦芳/Li, Jinfang and 周国炎/Guoyan Zhou. 仡央语言探索/Geyang yu yan tan suo. Beijing, China: 中央民族大学出版社/Zhong yang min zu da xue chu ban she, 1999.
- ^ "Archived copy". Archived from the original on 2014-04-21. Retrieved 2011-10-23.
{{cite web}}: CS1 maint: archived copy as title (link) - ^ Numbers 1-9 are suffixed with du35.
- ^ Note: Mulao here is a Red Gelao language variety spoken in Guizhou, and is not the same as Mulam, a Kam-Sui language of Guangxi.
- ^ a b c d e 遵义地区志:民族志 (1999)
Further reading
[edit]- Ostapirat, Weera (2000). "Proto-Kra". Linguistics of the Tibeto-Burman Area 23 (1): 1-251
- Edmondson, Jerold A. (2002). The Laha language and its position in Proto-Kra.
- Sagart, Laurent. (2004) The higher phylogeny of Austronesian and the position of Tai-Kadai. Oceanic Linguistics 43,2: 411-444.
External links
[edit]Kra languages
View on GrokipediaIntroduction
Names
The Kra languages derive their name from the reconstructed Proto-Kra form *kraC, an autonym meaning "human being," which appears in various descendant languages as forms such as *kra, *ka, *fa, or *ha.[9] This nomenclature was proposed by linguist Weera Ostapirat in his reconstruction of Proto-Kra, highlighting the group's internal self-designation for "person" or "people."[10] Within China, the languages are commonly termed the Geyang (仡央) branch, a designation coined by Chinese scholars Min and Zhang by combining "Ge" from Gelao and "Yang" from Buyang to represent major subgroups.[10] This name reflects official classifications in Chinese linguistic and ethnographic contexts, where it encompasses languages spoken primarily in Guizhou, Guangxi, and Yunnan provinces.[10] Historically, the group was included under the broader label "Kadai," an older term introduced by Paul K. Benedict in 1942 to denote the entire Kra-Dai family, though it has since been narrowed or replaced in favor of "Kra" for this specific branch.[10]Significance
The Kra languages, as an early-diverging branch of the Kra-Dai family, play a pivotal role in reconstructing the proto-phonology and historical development of the broader language group. Their divergent features, including distinct tonal systems and consonant inventories, provide critical evidence for linking the Kra branch to other subgroups like Kam-Sui and Tai, revealing shared innovations such as glottal constrictions in certain tone categories and vowel length contrasts before codas. This has enabled linguists to trace the family's internal diversification, with phylogenetic analyses estimating the Proto-Kra divergence around 2,435 years before present, highlighting an ancient split that informs the overall timeline of Kra-Dai expansion from southern China.[2][11] The systematic study of Kra languages culminated in Weera Ostapirat's seminal reconstruction of Proto-Kra, which not only solidified their position within Kra-Dai but also inspired the modern nomenclature "Kra-Dai," derived from the reconstructed autonyms of the Kra and Tai branches. This proposal marked a shift from earlier terms like "Tai-Kadai," emphasizing a more balanced representation of the family's structure and challenging prior views that marginalized Kra as mere outliers. By demonstrating regular sound correspondences across Kra varieties—such as the development of four tone categories into high/low reflexes—Ostapirat's work (2000) established a foundation for comparative studies, underscoring Kra's value in resolving debates on the family's genetic coherence.[12][3] Beyond linguistics, Kra languages hold sociolinguistic significance as the heritage of small ethnic minority groups in southern China and northern Vietnam, with total speakers numbering around 22,000 across seven languages, many of which are endangered due to assimilation pressures. Their preservation efforts contribute to documenting cultural diversity in the Mainland Southeast Asia sprachbund, where Kra-Dai languages, including Kra, act as vectors for areal features like tonality and classifiers, influencing neighboring families such as Austroasiatic. This understudied branch thus aids in broader understandings of language contact, migration, and identity in East and Southeast Asia.[2][11]Reconstruction
Proto-Kra phonology
The phonology of Proto-Kra, the reconstructed ancestor of the Kra branch of the Kra-Dai language family, was first systematically reconstructed by Weera Ostapirat in his 2000 monograph. This reconstruction draws on comparative data from six representative Kra languages and their dialects: Gelao (various varieties including A'ou and Aqaw), Lachi, Laha, Buyang, Paha, and Pubiao. Ostapirat's analysis identifies a syllable structure of the form (C₁)(C₂)V(C₃), where C₁ is a main initial, C₂ a medial or preinitial (often glottal or liquid), V a vocalic nucleus, and C₃ a final consonant or tone-bearing coda. The system reflects typical Kra-Dai areal features, such as sesquisyllabicity in some forms and the development of tones from earlier segmental contrasts, but with innovations like a robust retroflex series unique to the branch. Proto-Kra features a large consonant inventory of 32 phonemes, including series of voiceless aspirated and unaspirated stops, voiced stops, nasals, affricates, fricatives, laterals, rhotics, and glides. Notably, it includes a full set of seven simple retroflex initials (*ʈ, *ʈʰ, *ɖ, *ʈʂ, *ʈʂʰ, *ɖʐ, ɳ) and eleven complex retroflex clusters (e.g., *ʈ-l-, *ɖ-l-, *ʔɳ-), as well as retroflex rhotics (*hr-, r-). These retroflexes, which do not survive as distinct sounds in any modern Kra language, likely arose from earlier alveolar or palatal contacts and merged with alveolar or palatal series in daughter languages; for instance, Proto-Kra *mʈa^A 'eye' corresponds to alveolar reflexes like Gelao mta^1. Only seven consonants occur as finals: voiceless stops *-p, *-t, *-k; nasals *-m, *-n, *-ŋ; and possibly a glide or nasalized coda. Preinitial glottal stops (*ʔ-) and liquids (*l-, r-) frequently form clusters, contributing to sesquisyllabic onsets in words like *ʔɳəŋ^B 'salty'. Subsequent scholarship has questioned the retroflex series, proposing disyllabic origins for some forms (e.g., *ma.ta^A 'eye') to explain the lack of direct reflexes without invoking unattested mergers. The vowel system is modest, with six monophthongs forming a symmetrical trapezoidal pattern: high *i and *u, mid *e and *o, central *ə, and low *a. These occur in both open and closed syllables, with length potentially contrastive in some environments though not fully distinguished in the reconstruction. Four diphthongs are posited—*ai, *aɯ, *ui, *au—restricted to open syllables, as in *kau^A 'forest'. Vowel qualities show regular correspondences across Kra languages, such as *ə merging with *a in some daughter branches. Lexical tones number four, labeled A, B, C, and D in the conventional Kra-Dai system, arising from the split of earlier proto-final consonants and phonation types in Pre-Proto-Kra-Dai. Tone A is typically high rising or level, B low falling, C high with creaky or glottalized phonation (reflecting a proto-glottal stop), and D low level or falling, confined to closed syllables with stop or nasal codas. For example, *na^A 'thick' contrasts with *na^D in checked syllables. This tonal system, while shared with other Kra-Dai branches, shows branch-specific innovations in C-tone glottalization and the restriction of D to non-open syllables. Ostapirat's reconstruction ties these tones to higher-level Kra-Dai etyma, supporting the family's internal coherence.Proto-Kra vocabulary
The reconstruction of Proto-Kra vocabulary relies on the comparative method, utilizing data from six primary Kra languages: Lachi, three varieties of Gelao, Buyang, Laha, Paha, and Pubiao (Qabiao). Weera Ostapirat's 2000 monograph provides the foundational lexicon, comprising around 250 etyma drawn from basic vocabulary domains such as body parts, numerals, kinship, nature, and daily activities. These reconstructions emphasize monosyllabic roots with tonal distinctions (marked as A–D, corresponding to level, rising, falling, and checked tones) and petiolar prefixes (e.g., *C- for presyllables), reflecting the phonological system outlined in parallel studies.[13] Ostapirat's etyma demonstrate regular correspondences across daughter languages, enabling the identification of innovations and retentions. For example, body part terms often preserve initial clusters or liquids, as in *krai B 'head' (reflected as /xɯi/ in Lachi and /kʰlɛ/ in Gelao) and *m-ʈa A 'eye' (cognate with /mta/ in Buyang and /mtaː/ in Pubiao). Such forms highlight Proto-Kra's sesquisyllabic tendencies in some roots, though most are reduced to monosyllables in modern reflexes. Kinship vocabulary includes *mai C 'mother' (seen in /mɛ/ Lachi and /mɔj/ Gelao) and *pa B 'father' (/pʰa/ Buyang, /pa/ Pubiao), underscoring familial terms' conservatism. Natural phenomena etyma, like *ʔuŋ C 'water' (/ʔuŋ/ Lachi, /ʔɔŋ/ Gelao), reveal shared semantic fields with higher-level Kra-Dai reconstructions.[13] Numeral systems are among the most stable, providing crucial evidence for subgrouping and external affiliations. Ostapirat reconstructs a decimal base with forms showing initial variation and tonal contours:| Numeral | Proto-Kra Form | Example Reflexes |
|---|---|---|
| one | *tʂəm C | /tʃʰam/ (Lachi), /tsʰaŋ/ (Gelao) |
| two | *sa A | /saj/ (Buyang), /sa/ (Pubiao) |
| three | *tu A | /tʰu/ (Pubiao), /to/ (Gelao) |
| four | *pə A | /pə/ (Lachi), /fa/ (Buyang) |
| five | *r-ma A | /ŋma/ (Gelao), /ma/ (Laha) |
| six | *x-nəm A | /snam/ (Lachi), /nɛm/ (Pubiao) |
| seven | *t-ru A | /tʰɯ/ (Buyang), /sru/ (Gelao) |
| eight | *m-ru A | /mɯ/ (Paha), /pʰru/ (Lachi) |
| nine | *s-ɣwa B | /sŋwa/ (Gelao), /kwa/ (Pubiao) |
| ten | *pwlot D | /pʷlɔt/ (Buyang), /plɔt/ (Lachi) |
Classification
Ostapirat (2000)
In 2000, Weera Ostapirat published a seminal reconstruction of Proto-Kra in his dissertation, establishing the Kra languages as a primary branch of the Kra-Dai family distinct from Tai, Kam–Sui, and Hlai.[15] Drawing on comparative data from phonological correspondences and shared vocabulary, Ostapirat identified Kra as a coherent genetic unit supported by approximately 40 lexical innovations unique to the group, such as reflexes of proto-forms not found in other Kra-Dai branches.[15] This work shifted the understanding of Kra from Benedict's earlier "Kadai" outliers to a well-defined subgroup, emphasizing innovations like complex initial consonant clusters and tonal developments.[6] Ostapirat proposed a classification into four main subgroups based on systematic sound changes and lexical retentions, treating Gelao, Lachi, and Laha as having internal dialectal divisions while others remain more uniform.[15] The Western Kra subgroup includes Gelao (with northern, southern, and southwestern varieties) and Lachi (with northern, southern, and southwestern varieties), sharing innovations such as merged initial stops and specific vowel shifts.[6] Southern Kra is represented by Laha (northern and southern dialects), characterized by retained aspirated stops and distinct tonal contours.[15] Central Kra consists solely of Paha, a conservative language preserving proto-initial fricatives.[15] Eastern Kra encompasses Buyang (northern and southern dialects) and Lakkia, unified by shared retroflex initials and lexical items like *kraw for 'person'.[15] Additionally, Ostapirat incorporated Laqua (also known as Pubiao or Qabiao) as a monotypic branch, linking it closely to Eastern Kra through phonological parallels, such as simplified syllable codas.[6] This structure highlights Kra's internal diversity while demonstrating its unity via proto-forms like *ʔŋaːᴬ 'I' and *mruːᴮ 'dog', reconstructed across the subgroups.[15] Ostapirat's classification excluded languages like Sui and Kam, reassigning them to Kam–Sui, thereby refining the family's internal phylogeny and influencing subsequent research.[6]| Subgroup | Languages and Varieties | Key Innovations |
|---|---|---|
| Western Kra | Gelao (northern, southern, southwestern); Lachi (northern, southern, southwestern) | Merged voiceless stops; vowel harmony patterns |
| Southern Kra | Laha (northern, southern) | Retained aspiration; mid-tone developments |
| Central Kra | Paha | Preserved fricative initials |
| Eastern Kra | Buyang (northern, southern); Lakkia | Retroflex series; shared ethnonyms |
| (Monotypic) | Laqua/Pubiao | Simplified codas; lexical ties to Eastern |
Hsiu (2014) and later updates
Andrew Hsiu advanced the classification of the Kra languages through extensive fieldwork and phylogenetic methods, building on prior work by Edmondson (2011). His 2014 analysis incorporated computational phylogenetics to refine subgroupings, emphasizing the internal diversity of key languages like Gelao and the position of Biao. Hsiu proposed that Biao, spoken in northwestern Guangdong, consists of three mutually unintelligible varieties (Shidong, Yonggu, and Dagang) that share phonological and lexical features with Lakkja, potentially forming a distinct subgroup within Kra-Dai or an independent primary branch coordinate with Kra.[16] This placement highlights Biao's peripheral status relative to core Kra languages, with shared innovations in initial consonants and vocabulary suggesting early divergence. Central to Hsiu's 2014 framework is a detailed subdivision of the Gelao languages, the most diverse Kra subgroup, based on comparative wordlists and dialect surveys. He positioned Lachi as a close sister clade to Gelao within Northern Kra, diverging early but retaining shared Proto-Kra retentions like lateral codas. Gelao itself divides into five main color-based subgroups, each encompassing multiple endangered varieties: Red Gelao (e.g., Vandu, A'ou, Bigong, Hongfeng, Houzitian), White Gelao (e.g., Judu, Moji, Wantao, Yueliangwan, Laozhai), Central Gelao (Qau and Hakei clusters), Black Gelao (Ayo, Aqao, Mulao), and Green Gelao (Dongkou, Xinzhai, Wanzi, Dagouchang). These subgroups exhibit mutual unintelligibility and varying degrees of endangerment, with Red Gelao varieties particularly vulnerable, some spoken by fewer than 50 individuals.[17][18] Hsiu's broader Kra classification aligns with and extends Edmondson's (2011) model, dividing the branch into Northern Kra (Gelao–Lachi) and Southern Kra (Laha, Buyang complex including Paha and Ecun, and Qabiao/Pubiao). Northern Kra languages preserve archaic features like complex consonant clusters, while Southern Kra shows innovations in tone and vowel systems. This structure underscores Kra's basal position in Kra-Dai, with evidence of substratal influences from Hmong-Mien and Austroasiatic.[19] Subsequent updates to Hsiu's framework include his 2017 documentation of mixed languages like Hezhang Buyi, which reveal Kra substrata in Northern Tai varieties, supporting deeper Kra-Tai interactions.[20] More recently, a 2023 Bayesian phylogenetic study using 100 Kra-Dai languages confirmed Kra's monophyly as one of five primary branches (alongside Hlai, Ong-Be, Tai, and Kam-Sui), with divergence estimated around 4,000–5,000 years ago in southern China, linked to environmental and migratory shifts. This analysis reinforces Hsiu's subgroupings through high posterior probabilities for internal nodes, while suggesting ongoing refinement via expanded lexical datasets.[11] Hsiu's MSEA Languages project continues to provide tentative updates, incorporating new field data on varieties like Red Gelao dialects.[17]Substrata
The Kra languages, spoken primarily in southern China, show evidence of substrate influences from adjacent language families due to historical contact in multilingual regions of Yunnan, Guangxi, and Guizhou provinces. These influences are most prominently attested through lexical borrowings and structural features borrowed from Northern Austroasiatic and Tibeto-Burman languages, reflecting the complex ethnolinguistic landscape of the area where Kra speakers interacted with pre-existing populations.[17] Northern Austroasiatic substrates are evident in basic vocabulary items across several Kra languages, such as words for 'water' and 'meat', which align with forms from branches like Khasi–Palaungic.[17] Qabiao and Buyang (excluding the Paha dialect) exhibit particularly heavy Austroasiatic borrowing, likely from local Northern Austroasiatic varieties, including terms related to daily life and environment that integrated early into the lexicon.[21] This suggests that Kra expansion involved assimilation of Austroasiatic-speaking groups, contributing to phonological and lexical layering in these languages.[10] Tibeto-Burman influences are similarly widespread, with loanwords for body parts and natural phenomena, including 'flower', 'hair', and 'mouth', appearing in core Kra varieties like Buyang and Gelao.[17] Structural parallels include pre-verbal negators, such as *ma- in Pudi and Judu Gelao or *pi- in Paha Buyang, which mirror Tibeto-Burman patterns (e.g., *ma- in Proto-Tibeto-Burman) and are rare elsewhere in Kra-Dai, indicating early contact-mediated adoption.[20] These features likely stem from interactions with Lolo-Burmese or Qiangic groups in northwestern Guangxi and Yunnan.[17] Limited Hmong-Mien substrate effects are noted in peripheral Kra languages like Biao, with borrowings for internal body parts such as 'liver', pointing to localized contact in mixed communities.[17] Overall, these substrata highlight the Kra languages' role as a northern outlier in Kra-Dai, shaped by prolonged areal diffusion rather than isolation.[22]Demographics
Speaker populations
The Kra languages, a small branch of the Kra-Dai family, are spoken by a relatively modest number of people, with estimates for the total speaker population ranging from approximately 10,000 to 22,000 individuals across China and Vietnam.[2][23] These languages are primarily associated with ethnic minority groups facing significant pressures from dominant languages like Chinese and Vietnamese, leading to high degrees of endangerment. Many Kra varieties are spoken only by older generations, with intergenerational transmission declining rapidly due to urbanization, education policies, and economic migration.[10] Speaker numbers vary widely by language, reflecting fragmented ethnic classifications and limited documentation. For instance, the Gelao languages (encompassing several dialects like A'ou, Cao Lan, and Qalao) are spoken by fewer than 6,000 people, primarily in Guizhou Province, China, where they constitute just 1.2% of the ethnic Gelao population of around 500,000.[18][24] Recent assessments confirm this low figure, emphasizing the languages' critically endangered status.[7] The following table summarizes approximate speaker populations for major Kra languages, based on key linguistic surveys (figures are estimates and may include ethnic populations where direct speaker counts are unavailable; data from the early 2000s onward show stability or slight decline):| Language | Approximate Speakers | Primary Locations | Notes/Source |
|---|---|---|---|
| Gelao (various dialects) | 5,000–6,000 | Guizhou, China | Critically endangered; ethnic population much larger.[18][7] |
| Buyang (including Paha) | ~2,000 | Yunnan/Guangxi, China; northern Vietnam | Small ethnic group; spoken in border villages.[2] |
| Lachi | ~2,000 | Yunnan, China; Hà Giang/Lào Cai, Vietnam | Ethnic La Chí population ~10,000, but speakers limited to adults.[2] |
| Laha | ~1,400 | Lào Cai/Sơn La, Vietnam | Ethnic population ~5,700; used by older adults only.[23][25] |
| Qabiao (Pubiao) | 700–1,300 | Yunnan, China; Hà Giang, Vietnam | Increasing slightly from 1989 census; endangered.[2] |
| En (Nùng Vên) | ~250 | Cao Bằng, Vietnam | Near-extinct; minimal documentation.[23] |
| Mulao | 0 (extinct) | Guangxi, China | Last fluent speakers deceased; ethnic classification persists.[26] |
Geographic distribution
The Kra languages, a branch of the Kra-Dai family, are primarily distributed across southern China and northern Vietnam, with speakers concentrated in remote, mountainous regions that reflect their historical dispersal from ancestral homelands in the Yangtze River basin during the late Holocene. Phylogeographic evidence indicates an early divergence and southward migration of Kra-Dai speakers, including Kra, originating from the Guangxi-Guangdong coastal area of South China toward Mainland Southeast Asia around 4,000–3,000 years ago, driven by agricultural expansions and environmental changes.[11] This distribution underscores the Kra languages' role as a northwestern periphery of the Kra-Dai family, with small, scattered communities often living alongside other ethnic groups like the Zhuang and Hmong-Mien.[10] In China, Kra languages are spoken mainly in the provinces of Guizhou, Guangxi, and Yunnan, where they form pockets in karst highlands and river valleys. Guizhou hosts the largest concentrations, particularly of Gelao varieties in counties like Longli, Duyun, and Rongjiang, with historical records tracing Gelao presence to the Tang Dynasty (7th–10th centuries CE). Guangxi features Buyang and related dialects in western areas such as Longlin and Napo counties, while Yunnan has Lachi in Jinchang, Paha in Yangliu, and Buyang in Xishuangbanna, often in border villages near Vietnam. These locations highlight the Kra's autochthonous status in pre-Han indigenous territories, with populations estimated at under 100,000 speakers total across China, many shifting to Mandarin or local dominant languages.[9][6] In northern Vietnam, Kra languages extend into the border provinces of Hà Giang, Lào Cai, and Sơn La, comprising a smaller but diverse set of communities amid ethnic minorities like the Hmong and Dao. Gelao is spoken in Hà Giang's Yên Minh district (e.g., Bản Ma Ché village), Lachi in nearby Đồng Văn and Quản Bạ districts (e.g., Bản Phùng), and Laha (or Pubiao variants) in Lào Cai's Bắc Hà and Sơn La's Mường La, with some Buyang influence in Cao Bằng. This transborder distribution, totaling fewer than 10,000 speakers, stems from migrations during the Qin-Han eras (221 BCE–220 CE), when Kra groups were displaced southward by Han expansions, preserving linguistic diversity in isolated highland enclaves despite pressures from Vietnamese and Chinese assimilation.[9][6][10]Linguistic features
Phonological characteristics
The Kra languages are characterized by a rich tonal system inherited from Proto-Kra, which featured a four-way tonal contrast labeled as tones A, B, C, and D. Tone A is associated with sonorant or open syllable endings and voiced onsets in the proto-language; tone B is linked to lax voicing features; tone C involves tense phonation; and tone D is restricted to checked syllables ending in stops.[27][9] This system has undergone mergers and splits in daughter languages, resulting in 4 to 9 tones in modern varieties, with some languages like certain Gelao dialects showing tonal mergers due to contact influences.[27][9][1] Consonant inventories in Kra languages are complex, featuring voiceless, voiced, and aspirated stops, as well as affricates and fricatives, with Proto-Kra reconstructing 32 consonants across labial, alveolar, postalveolar, retroflex, palatal, velar, and glottal places of articulation. Initial clusters are common, including prenasalized stops (e.g., *mb-, *nd-) and lateral clusters (e.g., *kl-, *kr-), which reflect an earlier stage of syllable complexity before reduction in some branches. A distinctive feature is the presence of breathy-voiced stops in languages like Lachi and Buyang, derived from Proto-Kra voiced stops, and a proposed retroflex series (*ʈ, *ɖ, *tʂ, dʐ, etc.) in the proto-reconstruction, though this has been debated as potentially arising from disyllabic forms or lenition rather than a dedicated series, given the lack of direct preservation in modern Kra languages. Final consonants are limited to eight in Proto-Kra: nasals (-m, *-n, -ŋ), liquids (-l, -r), and stops (-p, *-t, *-k), with *-l often developing into tones or glottal stops in contemporary varieties.[27][9][4] The vowel system of Proto-Kra includes six monophthongs—high (*i, *u), mid (*e, *o, ə), and low (a)—with length distinctions playing a role in tonal conditioning, particularly in open syllables. Diphthongs are restricted to four open-syllable rimes (-ai, *-aɯ, *-ui, *-au), which often merge or shift in daughter languages; for instance, *-aɯ may become a central vowel or trigger backing in Gelao. Vowel harmony or fronting/backing patterns appear in some modern Kra languages, influenced by areal contact with Sino-Tibetan groups, but these are not systematic in the proto-level reconstruction.[27] Syllable structure in Kra languages follows a (C)(C)V(C) template, with sesquisyllabic or disyllabic forms emerging from historical compounding or borrowing, though monosyllabicity dominates due to tone-bearing requirements. Unlike other Kra-Dai branches, Kra languages retain more conservative final consonants and clusters, contributing to their phonological diversity, but they lack widespread tone sandhi, distinguishing them from neighboring tonal families like Hmong-Mien. These features underscore the Kra branch's early divergence within Kra-Dai, with phonological innovations often linked to substratal influences from pre-Austroasiatic or Sino-Tibetan substrates in southern China.[27][28]Numeral systems
The numeral systems in Kra languages are characterized by their retention of the ancestral Proto-Kra-Dai vocabulary, a feature not shared with the Tai and Kam-Sui branches, where native forms have been extensively replaced by borrowings from Chinese or other Sino-Tibetan languages. This preservation allows for reliable reconstruction of early Kra-Dai numerals, primarily drawing from Kra and Hlai evidence, and highlights potential historical connections to Austronesian numeral forms, as initially noted in comparative studies. In Kra languages, numerals typically function with classifiers for counting nouns, following the analytic structure common to the family, and higher numbers beyond ten are often formed by multiplication or addition, such as combining units with terms for ten or hundred. The Proto-Kra numeral system, reconstructed by Ostapirat (2000), provides a foundational inventory for the branch, reflecting a quinary or decimal base with distinct roots for 1–10. These forms are attested across daughter languages like Gelao, Lachi, Buyang, and Qabiao, with variations due to phonological shifts, tone changes, and occasional prefix loss (e.g., the *r- in five). For instance, the form for "five" (*r-ma^A) appears as mpu in some Gelao varieties and ma in Buyang, while "six" (*x-nəm^A) is realized as nəm or naŋ in Gelao and Qabiao. This system underscores the conservative nature of Kra phonology and lexicon compared to more innovative branches.| Numeral | Proto-Kra Reconstruction | Tone Category | Example Reflex (Language) |
|---|---|---|---|
| one | *tʂəm | C | tʃəm (Proto-Western Kra, e.g., Lachi) |
| two | *sa | A | su (Gelao) |
| three | *tu | A | ta (Gelao) |
| four | *pə | A | pu (Gelao) |
| five | *r-ma | A | ma (Buyang) |
| six | *x-nəm | A | nəm (Qabiao) |
| seven | *t-ru | A | ʈu (Proto-Southern Kra, e.g., Laha) |
| eight | *m-ru | A | mu (Buyang) |
| nine | *s-ɣwa | B | swa (Gelao) |
| ten | *pwlot | D | blɔt (Buyang) |
| hundred | *kjən | A | kən (Proto-Eastern Kra, e.g., Qabiao) |
