Recent from talks
Contribute something
Nothing was collected or created yet.
Cham script
View on Wikipedia| Cham script Akhara Cam ꨀꨇꩉ ꨌꩌ | |
|---|---|
| Script type | |
Period | c. 350 CE – present[1] |
| Direction | Left-to-right |
| Languages | Cham, Sanskrit |
| Related scripts | |
Parent systems | |
Sister systems | Khmer, Kawi, Old Mon, Grantha, Tamil |
| ISO 15924 | |
| ISO 15924 | Cham (358), Cham |
| Unicode | |
Unicode alias | Cham |
| U+AA00–U+AA5F | |
| Brahmic scripts |
|---|
| The Brahmi script and its descendants |
The Cham script (Cham: ꨀꨇꩉ ꨌꩌ) is a Brahmic abugida used to write Cham, an Austronesian language spoken by some 245,000 Chams in Vietnam and Cambodia.[3] It is written horizontally left to right, just like other Brahmic abugidas.
History
[edit]The Cham script is a descendant of the Brahmi script of India.[3] Cham was one of the first scripts to develop from the Pallava script, this happened in the mid 350s CE. It came to Southeast Asia as part of the expansion of Hinduism and Buddhism. Hindu stone temples of the Champa civilization contain both Sanskrit and Chamic language stone inscriptions.[4] The earliest inscriptions in Vietnam are found in Mỹ Sơn, a temple complex dating from c. 300 CE to c. 1200 CE.[5][6] The oldest inscription is written in faulty Sanskrit. After this, inscriptions alternate between Sanskrit and the Cham language of the times.[7]
Cham kings studied classical Indian texts, such as the Dharmaśāstra, and inscriptions make reference to Sanskrit literature. Eventually, while the Cham and Sanskrit languages influenced one another, Cham culture assimilated Hinduism, and Chams were eventually able to adequately express the Hindu religion in their own language.[7] By the 8th century, the Cham script had outgrown Sanskrit and the Cham language was in full use.[8] Most preserved manuscripts focus on religious rituals, epic battles and poems, and myths.[7]
Modern Chamic languages have the Southeast Asian areal features of monosyllabicity, tonality, and glottalized consonants. However, they had reached the Southeast Asia mainland disyllabic and non-tonal. The script needed to be altered to meet these changes.[4]
Variety
[edit]The Cham now live in two groups: the Western Cham of Cambodia and the Eastern Cham (Panduranga/Phan Rang Cham) of Vietnam. For the first millennium AD, the Chamic languages were a dialect chain along the Vietnam coast. The breakup of this chain into distinct languages occurred once the Vietnamese pushed south, causing most Cham to move back into the highlands while some like Phan Rang Cham became a part of the lowland society ruled by the Vietnamese. The division of Cham into Western and Phan Rang Cham immediately followed the Vietnamese overthrow of the last Cham polity.[4] The Western Cham people are mostly Muslim[9] and therefore prefer the Arabic script. The Eastern Cham are mostly Hindu and continued to use the Indic script. During French colonial times, both groups had to use the Latin alphabet.[citation needed]
There are two varieties of the Cham script: Akhar Thrah (Eastern Cham) and Akhar Srak (Western Cham). The two are distinct enough to be encoded in separate blocks, the Eastern Cham block included in Unicode Standard version 5.1 since March 2008, the Western Cham block approved but still awaiting inclusion as of late 2023.[10][11][12] A standard ALA-LC romanization of both varieties, which is based on EFEO romanization of Cham, is available.[13][14]
Usage
[edit]The script is highly valued in Cham culture, but this does not mean that many people are learning it.[15] There have been efforts to simplify the spelling and to promote learning the script, but these have met with limited success.[16][17] Traditionally, boys learned the script around the age of twelve when they were old and strong enough to tend to the water buffalo. However, women and girls did not typically learn to read.[8] The traditional Indic Cham script is still known and used by Vietnam's Eastern Cham but no longer by the Western Cham.[18]
Structure
[edit]Similar to other abugidas, the consonants of Cham have the inherent vowel. Dependent vowel diacritics are used to modify the inherent vowel.[3] Since Cham does not have virāma, special characters should be used for pure consonants. This practice is similar to the chillu consonants of the Malayalam script.

Most consonant letters, such as [b], [t], or [p], include an inherent vowel [a] which does not need to be written. The nasal stops, [m], [n], [ɲ], and [ŋ] (the latter two transliterated ny and ng in the Latin alphabet) are exceptions, and have an inherent vowel [ɨ] (transliterated â). A diacritic called kai, which does not occur with the other consonants, is added below a nasal consonant to write the [a] vowel.[8]
Cham words contain vowel and consonant-vowel (V and CV) syllables, apart from the last, which may also be CVC. There are a few characters for final consonants in the Cham script; other consonants merely extend a longer tail on the right side to indicate the absence of a final vowel.[8]
Consonants
[edit]ꨆ ka
|
ꨇ kha
|
ꨈ ga
|
ꨉ gha
|
ꨊ ngâ
|
ꨋ nga
|
ꨌ ca
|
ꨍ cha
|
ꨎ ja
|
ꨏ jha
|
ꨐ nyâ
|
ꨑ nya
|
ꨒ nja
|
ꨓ ta
|
ꨔ tha
|
ꨕ da
|
ꨖ dha
|
ꨗ nâ
|
ꨘ na
|
ꨙ nda
|
ꨚ pa
|
ꨛ pa
|
ꨜ pha
|
ꨝ ba
|
ꨞ bha
|
ꨟ mâ
|
ꨠ ma
|
ꨡ mba
|
ꨢ ya
|
ꨣ ra
|
ꨤ la
|
ꨥ wa
|
ꨦ ṣa
|
ꨧ sa
|
ꨨ ha
|
Medial consonants
[edit]| Diacritic | ◌ꨳ -ia
|
◌ꨴ -ra
|
◌ꨵ -la
|
◌ꨶ -ua
|
|---|---|---|---|---|
| Example | ꨆꨳ kia
|
ꨆꨴ kra
|
ꨆꨵ kla
|
ꨆꨶ kua
|
Final consonants
[edit]Cham does not employ a virama to suppress vowels. Final consonants are indicated in one of three ways: an explicit final consonant letter, a combining diacritic mark, or by ꨥ.
ꩀ -k
|
ꩂ -ng
|
ꩄ -c
|
ꩅ -t
|
ꩆ -n
|
ꩇ -p
|
ꩈ -y
|
ꩉ -r
|
ꩊ -l
|
ꨥ -w
|
ꩋ -ṣ
|
Independent vowels
[edit]Six of the initial vowels are represented with unique letters:[19]
ꨀ a
|
ꨁ i
|
ꨂ u
|
ꨃ é
|
ꨄ ai
|
ꨅ o
|
Dependent vowels
[edit]Other initial vowels are represented by adding a diacritic to the letter ꨀ (a).[19] The same diacritics are used with consonants to change their inherent vowel:
| -ā | -i | -ī | -ei | -u | -ū | -e | -ē | ||
|---|---|---|---|---|---|---|---|---|---|
| diacritics | ◌ꨩ | ◌ꨪ | ◌ꨫ | ◌ꨬ | ◌ꨭ | ◌ꨭꨩ | ◌ꨮ | ◌ꨮꨩ | |
| shown with ꨆ (ka) | ꨆꨩ | ꨆꨪ | ꨆꨫ | ꨆꨬ | ꨆꨭ | ꨆꨭꨩ | ꨆꨮ | ꨆꨮꨩ | |
| -é | -é | -o | -ō | -ai | -ao | -â | -â | -au | |
| diacritics | ꨯꨮ | ꨯꨮꨩ | ꨯ | ꨯꨩ | ꨰ | ꨯꨱ | ◌ꨲ | ◌ꨲꨩ | ◌ꨮꨭ |
| shown with ꨆ (ka) | ꨆꨯꨮ | ꨆꨯꨮꨩ | ꨆꨯ | ꨆꨯꨩ | ꨆꨰ | ꨆꨯꨱ | ꨆꨲ | ꨆꨲꨩ | ꨆꨮꨭ |
Numerals
[edit]Cham has a distinctive set of digits:[19]
| Cham numerals | 0 ꩐
|
1 ꩑
|
2 ꩒
|
3 ꩓
|
4 ꩔
|
5 ꩕
|
6 ꩖
|
7 ꩗
|
8 ꩘
|
9 ꩙
|
|---|---|---|---|---|---|---|---|---|---|---|
| Names | thaoh ꨔꨯꨱꩍ |
sa ꨧ |
dua ꨕꨶ |
klau ꨆꨵꨮꨭ |
pak ꨚꩀ |
limâ ꨤꨪꨟ |
nam ꨗꩌ |
tajuh ꨓꨎꨭꩍ |
dalapan ꨕꨤꨚꩆ |
salapan ꨧꨤꨚꩆ |
Other symbols
[edit]
| Symbol | Name | Function |
|---|---|---|
| ꩜ | Spiral | Mark the beginning of a section. |
| ꩝ | Danda | Text break |
| ꩞ | Double Danda | Text break with progressive values of finality |
| ꩟ | Triple Danda | Text break with progressive values of finality |
Unicode
[edit]Cham script was added to the Unicode Standard in April, 2008 with the release of version 5.1.[19]
The Unicode block for Cham is U+AA00–U+AA5F:
| Cham[1][2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+AA0x | ꨀ | ꨁ | ꨂ | ꨃ | ꨄ | ꨅ | ꨆ | ꨇ | ꨈ | ꨉ | ꨊ | ꨋ | ꨌ | ꨍ | ꨎ | ꨏ |
| U+AA1x | ꨐ | ꨑ | ꨒ | ꨓ | ꨔ | ꨕ | ꨖ | ꨗ | ꨘ | ꨙ | ꨚ | ꨛ | ꨜ | ꨝ | ꨞ | ꨟ |
| U+AA2x | ꨠ | ꨡ | ꨢ | ꨣ | ꨤ | ꨥ | ꨦ | ꨧ | ꨨ | ꨩ | ꨪ | ꨫ | ꨬ | ꨭ | ꨮ | ꨯ |
| U+AA3x | ꨰ | ꨱ | ꨲ | ꨳ | ꨴ | ꨵ | ꨶ | |||||||||
| U+AA4x | ꩀ | ꩁ | ꩂ | ꩃ | ꩄ | ꩅ | ꩆ | ꩇ | ꩈ | ꩉ | ꩊ | ꩋ | ꩌ | ꩍ | ||
| U+AA5x | ꩐ | ꩑ | ꩒ | ꩓ | ꩔ | ꩕ | ꩖ | ꩗ | ꩘ | ꩙ | ꩜ | ꩝ | ꩞ | ꩟ | ||
| Notes | ||||||||||||||||
Sample text
[edit]Below is a sample text in Cham, in Rumi, Jawi, and Cham scripts. This text is the translation of a famous Vietnamese short poetry.
| English Translation |
Loss of money; Sad for a few days |
|---|---|
| Cham Rumi Script |
Lahik jiên; drut druy hadôm harei |
| Cham Jawi Script |
لحيء جييٛن؛ دروت دروي حدوٛم حغاٛي |
| Cham Script |
ꨤꨨꨪꩀ ꨎꨳꨯꨮꩆ ꨕꨴꨭꩅ ꨕꨴꨭꩈ ꨨꨕꨯꩌ ꨨꨣꨬ |
| Vietnamese Original |
Mất tiền; Buồn vài ngày |
Notes
[edit]- ^ ꨀꨯꨱꩌ (U+AA00 & U+AA2F & U+AA31 & U+AA4C)
References
[edit]- ^ Marrison 1975, pp. 52–59.
- ^ Handbook of Literacy in Akshara Orthography, R. Malatesha Joshi, Catherine McBride (2019), p.29
- ^ a b c Cham. In The Unicode Standard, Version 11.0 (p. 661). Mountain View, CA: Unicode Consortium.
- ^ a b c Thurgood, Graham. From Ancient Cham to Modern Dialects: Two Thousand Years of Language Contact and Change. Honolulu: University of Hawaii Press, 1999.
- ^ Centre, UNESCO World Heritage. "My Son Sanctuary". UNESCO World Heritage Centre. Retrieved 2024-09-11.
- ^ Graham, Thurgood (11 September 2024). A Preliminary Sketch of Phan Rang Cham The Austronesian Languages of Asia and Madagascar (PDF). California State University, Chico. pp. Page=1.
- ^ a b c Claude, Jacques. "The Use of Sanskrit in the Khmer and Cham Inscriptions." In Sanskrit Outside India (Vol. 7, pp. 5-12). Leiden: Panels of the VIIth World Sanskrit Conference. 1991.
- ^ a b c d Blood, Doris E. "The Script as a Cohesive Factor in Cham Society". In Notes from Indochina on ethnic minority cultures. Ed. Marilyn Gregerson. 1980 p35-44.
- ^ Trankell & Ovesen 2004
- ^ "Proposal for encoding the Cham script in the BMP of the UCS"[1].
- ^ Hosken, Martin. (2019). L2/19-217 Proposal to encode Western Cham in the UCS.
- ^ Everson, M. & Cunningham, A. (2016). L2/16-198 Proposal to encode Western Cham in the SMP of the UCS.
- ^ ALA-LC Cham romanization
- ^ "Cham Alphabet". kauthara.org. 2015. Retrieved 2020-07-20.
- ^ Bruckmayr, Philipp. (2019). The Changing Fates of the Cambodian Islamic Manuscript Tradition. Journal of Islamic Manuscripts, 10(1), 1-23. doi:10.1163/1878464X-01001001
- ^ Blood (1980a, 1980b, 2008); Brunelle (2008).
- ^ Rachna, Thim (26 November 2019). "Cambodia's Cham Muslims Fear Loss of Ancient Script and Language". VOA Khmer. Retrieved 2022-05-18.
- ^ Akbar Husain, Wim Swann Horizons of Spiritual Psychology 2009 - Page 28 "The traditional Cham script, based on an Indian script, is still known and used by the Eastern Cham in Vietnam, but it has been lost by the Western Cham. The Cham language is also non-tonal. Words may contain one, two, or three syllables."
- ^ a b c d Everson, Michael (2006-08-06). "Proposal for encoding the Cham script in the BMP of the UCS" (PDF).
Bibliography
[edit]- Marrison, Geoffrey Edward (1975), "The Early Cham language and its relation to Malay", Journal of the Malaysian Branch of the Royal Asiatic Society, 48 (2 (228)): 52–59, JSTOR 41492110
- Etienne Aymonier, Antoine Cabaton (1906). Dictionnaire čam-français. Vol. 7 of Publications de l'École française d'Extrême-Orient. E. Leroux. Retrieved 2011-05-15.
- Blood, Doris (1980a). Cham literacy: the struggle between old and new (a case study). Notes on Literacy 12, 6–9.
- Blood, Doris (1980b). The script as a cohesive factor in Cham society. In Notes from Indochina, Marilyn Gregersen and Dorothy Thomas (eds.), 35–44. Dallas: International Museum of Cultures.
- Blood, Doris E. 2008. The ascendancy of the Cham script: how a literacy workshop became the catalyst. International Journal of the Sociology of Language 192:45-56.
- Brunelle, Marc. 2008. Diglossia, Bilingualism, and the Revitalization of Written Eastern Cham. Language Documentation & Conservation 2.1: 28–46. (Web based journal)
- Moussay, Gerard (1971). Dictionnaire Cam-Vietnamien-Français. Phan Rang: Centre Culturel Cam.
- Trankell, Ing-Britt and Jan Ovesen (2004). Muslim minorities in Cambodia. NIASnytt 4, 22–24. (Also on Web)
- R. Malatesha Joshi, Catherine McBride(2019). Handbook of Literacy in Akshara Orthography
External links
[edit]Cham script
View on GrokipediaHistorical Development
Origins and Early Inscriptions
The Cham script derives from the ancient Brahmi script of India, evolving through the Southern Brahmi tradition and specifically the Pallava-Grantha subfamily, which was adapted in Southeast Asia during the early centuries CE.[2] This development reflects the broader dissemination of Indic writing systems via trade and cultural exchanges. The script's abugida structure, featuring consonants with inherent vowels and diacritics, was tailored to record Austronesian Cham languages alongside Sanskrit, marking an early localization of foreign orthographic conventions.[2] The earliest known inscriptions employing the proto-Cham script appear in the Mỹ Sơn temple complex in central Vietnam, a major religious center of the Champa kingdom dating from approximately the 4th century CE, though some epigraphic evidence suggests activity as early as 300–350 CE.[6] These inscriptions, primarily in Sanskrit, served as foundational records of the kingdom's adoption of Hinduism, with the complex's stelae and temple bases hosting texts that document royal patronage and ritual dedications.[6] A notable precursor is the Võ Cạnh stele (Inscription C.40), dated to the 2nd–4th century CE and discovered in Khánh Hòa province, which represents the oldest Sanskrit epigraphy in Southeast Asia and illustrates the script's initial use for sacred purposes in proto-Champa contexts.[6] These early inscriptions played a crucial role in preserving Hindu religious texts, including hymns, myths, and invocations to deities like Shiva, while later examples incorporated Buddhist elements amid the kingdom's syncretic traditions.[6] Donative inscriptions on stelae, such as those detailing offerings and land grants at Mỹ Sơn, exemplify the script's function in legitimizing Champa rulers' authority through links to Indic cosmology and governance.[6] This epigraphic tradition underscores the Champa kingdom's cultural integration of Brahmic writing systems, facilitating the documentation of temple constructions and priestly lineages from the 4th century onward.[2]Evolution Through Influences
By the 8th century, the Cham script had evolved into a mature abugida, derived from the Pallava variant of the Brahmi script and adapted to accommodate the phonology of the Austronesian Cham language. This transformation involved modifications such as the representation of diphthongs and the truncation of certain consonant clusters to align with Cham's syllable structure, distinguishing it from its South Indian precursors while enabling the recording of both Sanskrit loanwords and vernacular texts in early inscriptions.[7][8] During the 9th to 15th centuries, interactions between Champa and the Khmer Empire exerted significant influence on the script's development, particularly through cultural exchanges and periods of Khmer occupation in Cham territories. Khmer script features, such as specific glyph forms for consonants like /na/ and /ma/, were incorporated, reflecting shared Indic roots and regional adaptations that enhanced the script's utility for bilingual inscriptions in Sanskrit and Old Cham. These influences peaked amid conflicts and alliances, including Khmer incursions into central Vietnam, which prompted Cham scribes to refine orthographic conventions for local phonemes absent in Khmer, such as certain Austronesian vowels.[9][8] The conquest of Champa's capital Vijaya by Vietnamese forces in 1471 marked a pivotal divergence in the script's trajectory, as the kingdom continued until its final fall in 1832, with surviving Cham communities splitting into eastern groups in Vietnam and western exiles in Cambodia and elsewhere, leading to variant forms shaped by isolation and external pressures. In western variants, Islamic conversions—accelerating from the 11th century but intensifying post-1471 through Malay trade networks—introduced Arabic-derived elements, evolving into Akhar Srak, a modified abugida blending Indic forms with Jawi influences for religious texts. Meanwhile, eastern Akhar Thrah retained more traditional features, though both underwent standardization between the 15th and 19th centuries to support literary production, including epic poetry like ariya and historical chronicles such as tapuk, which preserved Cham identity amid diaspora.[9][2]Colonial Impacts and Modern Adaptations
During the French colonial period in Indochina, authorities introduced Latin-based romanization systems for the Cham script to facilitate administrative and scholarly work, beginning with Étienne Aymonier's 1889 publication of the first such system in Grammaire de la langue Chame, developed in collaboration with Cham scholars.[2] Subsequent refinements by Antoine Cabaton in a 1906 Cham-French dictionary and Paul Mus in early 20th-century articles further adapted the script, but these efforts faced limitations, including omissions of nasal consonants and a focus on French phonetics that reduced broader applicability.[2] Adoption remained limited due to resistance from Cham leaders, who viewed romanization as a threat to cultural identity and the traditional script's obsolescence.[2] Following Vietnam's independence and unification in 1975, as well as Cambodia's post-colonial instability, efforts emerged to simplify and revive the Cham script through phonetic reforms. In Vietnam, the 1978 establishment of the Committee for Standardization of Cham Script in Phan Rang initiated standardization of the Akhar Thrah variant and first-language education programs.[7] Between 1979 and 1995, spelling reforms phonemicized the script and regularized it based on Ninh Thuận and Bình Thuận dialects to align more closely with spoken Cham, though these changes sparked controversy within communities over cultural authenticity.[7] In Cambodia, similar post-independence initiatives were hampered by ongoing political turmoil, with limited documented reforms until later stabilization. Vietnam's 1975 unification under communist rule and Cambodia's civil war (1970–1975) culminating in the Khmer Rouge regime (1975–1979) profoundly disrupted Cham script transmission, as assimilation policies and widespread violence targeted minority cultural practices.[2] In Vietnam, pre-1975 school teaching attempts were nonsystematic and poorly enrolled, while unification shifted emphasis to Vietnamese language dominance, eroding written Cham literacy.[7] The Cambodian civil war and Khmer Rouge era led to the destruction of numerous Cham manuscripts and suppression of ethnic expressions, forcing reliance on oral traditions for language and cultural preservation among surviving communities.[10] Post-2000 community-led initiatives have focused on reviving the script through education, particularly in Vietnam's Ninh Thuận province, where Cham language classes—incorporating the traditional script—have expanded significantly. By the 2019–2020 school year, 24 schools offered Cham instruction to 8,126 students across 288 classes, building on earlier programs to integrate the script into primary curricula for cultural continuity. As of 2025, efforts continue to expand, with provinces like Lâm Đồng offering Cham classes in 12 primary schools to 3,643 students across 142 classes.[11][12] These efforts, supported by local Cham boards and ongoing textbook development, aim to counter generational literacy loss while fostering bilingual proficiency.[13] In Cambodia, recent initiatives as of 2025 include cultural exhibitions to preserve the script and efforts toward Unicode encoding for Western Cham.[14][15]Varieties
Eastern Cham (Akhar Thrah)
Eastern Cham script, known as Akhar Thrah, serves as the primary writing system for the Eastern variety of the Cham language, spoken by approximately 138,000 people in the southern central Vietnamese provinces of Ninh Thuận and Bình Thuận (as of 2023).[16][17][7] This variety is concentrated among communities along the coastal regions, where the script is employed in religious texts, literature, and cultural documentation, reflecting the speakers' Hindu and animist traditions.[7] The script's phonemic adaptations include a set of 33 consonants tailored to Eastern Cham's sound system, which features distinctions such as implosives (/ɓ/, /ɗ/, /ʄ/) and retroflex consonants (e.g., /ʈ/, /ɖ/) absent in its Sanskrit origins.[18][19] These are represented by dedicated graphemes derived from Brahmic forms, with inherent vowels (typically /a/ for most consonants and /ɨ/ for nasals) modified by diacritics to match the language's nine vowel qualities and length contrasts.[16][20] For instance, the high central unrounded vowel /ɨ/ appears inherently after nasals and can involve nasalization, as in representations of words like mɨə ("to have"), where a tilde-like diacritic or contextual nasal consonant indicates the nasalized form /mɨ̃ə/.[19][16] Visually, Akhar Thrah exhibits rounded letterforms influenced by Khmer script, with characters hanging from a baseline in a left-to-right direction and no spaces between words, creating continuous text blocks.[20][2] This style facilitates compact inscription on stone, palm leaves, and modern prints, emphasizing aesthetic flow over segmentation. The script's structure supports the CV(C) syllable pattern of Eastern Cham, using subjoined forms for clusters and final consonants marked by extended strokes or dedicated signs.[20]Western Cham (Akhar Srak)
The Western Cham variety of the Cham language is spoken by approximately 318,000 people, with about 270,000 in Cambodia and 49,000 in southern Vietnam (as of 2023).[2][21][22] These speakers are concentrated in Cambodia's Kampong Cham province and Vietnamese provinces such as An Giang and Tay Ninh, where the community maintains distinct cultural practices tied to their linguistic heritage.[2][21] Akhar Srak, also known as Akhar Ka Kha, represents the script variant used by Western Cham communities, diverging historically from the broader Cham tradition following the widespread conversion to Islam in the 15th century.[23] This divergence occurred amid interactions with Muslim traders from the Arabian Peninsula and Malay regions, leading to the adoption of Islamic cultural elements that influenced script usage and manuscript production.[24] As an abugida derived from ancient Indic scripts, Akhar Srak features inherent vowels and consonant signs adapted to the Western dialect's phonology, including pronunciation shifts such as mid-word /ra/ rendered as /ga/ and initial /wa/ hardening to /va/.[2] The script often appears in manuscripts intermixed with Arabic elements, reflecting hybrid writing practices in religious and cultural contexts.[25] The Western Cham dialect incorporates numerous loanwords from Malay and Arabic, introduced through trade and religious exchanges, which have shaped its phonemic system by adding consonant clusters in polysyllabic forms not as prominent in other Chamic varieties.[26][15] Examples include Arabic terms like "Jibril" for Gabriel, integrated with local phonetic adjustments.[15] Since the 17th century, following further consolidation of Islamic practices between 1607 and 1676, Western Cham speakers have shown a strong preference for the Jawi script—an Arabic-based adaptation—for religious texts, while reserving Akhar Srak mainly for secular poetry, folktales, and community literature to preserve ethnic identity amid external influences.[27][28]Usage and Cultural Role
Traditional Applications
The Cham script traditionally served as the primary medium for recording Hindu-Buddhist mantras and sacred texts, preserving spiritual practices central to Cham identity. It was prominently featured in temple inscriptions, such as those at the Po Nagar complex dating to 965 AD, which document rituals honoring the goddess Po Nagar and other deities through engraved invocations and dedications on altars and stone structures. These inscriptions not only invoked divine protection but also outlined ceremonial protocols for worship, integrating the script into communal religious life.[1][4] In literature, the script facilitated the composition and transmission of epic poems, exemplified by the 19th-century Ariya Cam Bini, a narrative work exploring themes of love, morality, and Cham history. Manuscripts in the script housed such epics alongside historical chronicles and folklore, serving as repositories of cultural knowledge passed down through generations. This literary tradition underscored the script's role in articulating Cham worldview and societal values.[29][1] Educationally, the script was transmitted exclusively to boys beginning around age 12 in village settings, where instruction emphasized rote memorization and manual copying of texts to instill literacy and cultural continuity. This male-centric process, often conducted by elders or religious figures, paired script learning with recitation of the Cham zodiac, reinforcing its ties to ritual and daily observances. Girls were typically excluded from formal literacy, limiting access to written forms.[1][30] The script intertwined with oral traditions, appearing in folk songs that recited poetic verses and in storytelling performances, including shadow puppetry, where written narratives were dramatized to engage communities during festivals and rituals. Both Eastern and Western varieties supported these expressions, adapting to local dialects while maintaining a shared heritage.[1][4]Contemporary Challenges and Revival Efforts
In recent decades, literacy rates in the Cham script have remained low among youth, with studies from the 2000s indicating limited proficiency and high rates of forgetting after only 2 hours per week of instruction in select schools.[7] This decline is attributed to the overwhelming dominance of the Latin-based Vietnamese script (quốc ngữ) in formal education and media across Vietnam and Cambodia.[7] This shift has marginalized the traditional Brahmic-derived script, limiting its practical application and leading to widespread diglossia where spoken Cham coexists uneasily with Vietnamese or Khmer as the primary written languages.[7] Compounding these issues are broader socio-economic pressures, including rapid urbanization that exposes younger generations to mainstream linguistic norms and assimilation policies in Vietnam that prioritize national unity over ethnic minority scripts. In Cambodia, competition from the Khmer script further erodes Western Cham (Akhar Srak) usage, as ethnic Chams increasingly adopt Khmer for education, administration, and daily communication to integrate into the dominant society.[31][7] These factors have resulted in the script's near-absence from public domains, with many Chams reporting limited retention even after primary schooling due to insufficient instructional hours and materials.[32] Revival initiatives have gained momentum through governmental and non-governmental efforts. In Vietnam, the government's ethnic language programs, launched in the 2010-2011 academic year, incorporated Cham script instruction into curricula across 740 schools, serving over 110,000 students and marking a formal commitment to minority language preservation.[33] In Cambodia, NGOs have driven digital font development and mobile compatibility for the Cham script, enabling easier online production and dissemination of texts to counter technological barriers.[34] These projects, alongside community-led workshops, aim to standardize and modernize the script for contemporary use. More recently, as of 2023, cultural advocates like Leb Ke have worked to revive Cham poetry traditions, while by 2025, UNESCO reported growing enthusiasm among young Chams for language preservation efforts, alongside initiatives by religious groups such as the Kan Imam San sect promoting script teaching among Cambodian youth.[35][36][31] The script retains cultural vitality in festivals like Kate, where it features in ritual inscriptions, prayers, and ceremonial documents that honor ancestors and reinforce ethnic identity among participants.[37] Community media outlets and emerging online content on social platforms, including short educational videos and language classes shared by groups like the Khmer Institute for Young Development, are fostering renewed interest and accessibility, particularly among diaspora and urban youth.[38] Such grassroots digital efforts highlight the script's potential role in sustaining Cham heritage amid globalization.[39]Orthographic Structure
Consonant Inventory
The Cham script features 35 base consonant letters, each carrying an inherent vowel sound transcribed as /ă/ (a short central vowel) unless modified by dependent vowel signs. These consonants are organized in the traditional varga (class) order derived from Brahmic scripts, progressing from gutturals (velars) through palatals, dentals, retroflexes, and labials, followed by semivowels, sibilants, and aspirates. This arrangement reflects the script's historical roots in ancient Indian writing systems while adapting to the phonology of the Cham language, an Austronesian tongue spoken primarily in Vietnam and Cambodia.[40] To accommodate Cham-specific phonemes, the inventory includes pairs of aspirated and unaspirated stops (e.g., /k/ vs. /kʰ/, /p/ vs. /pʰ/), as well as dedicated letters for nasals like /ɲ/ (palatal) and /ŋ/ (velar), which are essential for distinguishing words in the language's syllable structure. Nasals typically bear a distinct inherent vowel /ɨ/ or /ə/ rather than /ă/, aiding in phonetic clarity. The full set of base consonants, with their approximate IPA values and Unicode codepoints, is presented below in varga order:| Varga Class | Letter | Codepoint | IPA | Example Glyph |
|---|---|---|---|---|
| Gutturals | KA | U+AA06 | /k/ | ꨆ |
| KHA | U+AA07 | /kʰ/ | ꨇ | |
| GA | U+AA08 | /ɡ/ | ꨈ | |
| GHA | U+AA09 | /ɡʱ/ | ꨉ | |
| NGUE | U+AA0A | /ŋʷ/ | ꨊ | |
| NGA | U+AA0B | /ŋ/ | ꨋ | |
| Palatals | CHA | U+AA0C | /c/ | ꨌ |
| CHHA | U+AA0D | /cʰ/ | ꨍ | |
| JA | U+AA0E | /ɟ/ | ꨎ | |
| JHA | U+AA0F | /ɟʱ/ | ꨏ | |
| NHUE | U+AA10 | /ɲʷ/ | ꨐ | |
| NHA | U+AA11 | /ɲ/ | ꨑ | |
| NHJA | U+AA12 | /ɲɟ/ | ꨒ | |
| Dentals | TA | U+AA13 | /t̪/ | ꨓ |
| THA | U+AA14 | /t̪ʰ/ | ꨔ | |
| DA | U+AA15 | /d̪/ | ꨕ | |
| DHA | U+AA16 | /d̪ʱ/ | ꨖ | |
| NUE | U+AA17 | /nʷ/ | ꨗ | |
| NA | U+AA18 | /n/ | ꨘ | |
| Retroflex | DDA | U+AA19 | /ɖ/ | ꨙ |
| Labials | PA | U+AA1A | /p/ | ꨚ |
| PPA | U+AA1B | /p/ | ꨛ | |
| PHA | U+AA1C | /pʰ/ | ꨜ | |
| BA | U+AA1D | /b/ | ꨝ | |
| BHA | U+AA1E | /bʱ/ | ꨞ | |
| MUE | U+AA1F | /mʷ/ | ꨟ | |
| MA | U+AA20 | /m/ | ꨠ | |
| BBA | U+AA21 | /ɓ/ | ꨡ | |
| Semivowels & Others | YA | U+AA22 | /j/ | ꨢ |
| RA | U+AA23 | /r/ | ꨣ | |
| LA | U+AA24 | /l/ | ꨤ | |
| VA | U+AA25 | /v/ | ꨥ | |
| SSA | U+AA26 | /s/ | ꨦ | |
| SA | U+AA27 | /s/ | ꨧ | |
| HA | U+AA28 | /h/ | ꨨ |
Vowel Representation
The Cham script utilizes a combination of independent vowel letters and dependent vowel signs to denote its vowel phonemes, allowing for precise representation in syllable-initial positions or as modifications to consonant-inherent vowels. Independent vowels typically appear at the beginning of words or in isolation, while dependent signs attach to consonant bases—such as those detailed in the consonant inventory—to supplant the default inherent vowel, which is /ă/ for most consonants and /ɨ/ or /ə/ for nasals in Eastern Cham. This system reflects the script's Brahmic heritage, adapted to the Austronesian phonology of the Cham language.[40][41] There are six independent vowel letters, each representing a core vowel sound often realized with an initial glottal stop (ʔ) in Eastern Cham: ꨀ for /ʔa/, ꨁ for /ʔi/, ꨂ for /ʔu/, ꨃ for /ʔɛ/, ꨄ for /ʔai/, and ꨅ for /ʔo/. These forms originated from Sanskrit vowel akṣaras but underwent simplification in Cham, reducing redundancy and aligning with native vowel contrasts; for instance, the independent /ʔi/ (ꨁ) can also be composed using the letter A (ꨀ) plus a dependent sign in some contexts.[40][41] Dependent vowel signs consist of ten diacritics in the standardized encoding, positioned relative to the base consonant to indicate non-inherent vowels: post-consonant forms include ꨩ for /aː/, ꨪ for /i/, ꨫ for /iː/, ꨬ for /ei/, ꨭ for /u/, ꨮ for /ə/, ꨰ for /ai/, ꨱ for /əː/, and ꨲ for /ɨ/; the sign ꨯ for /o/ may appear pre- or post-consonant. These signs override the inherent vowel, enabling combinations like ꨆꨪ for /ki/ (where ꨆ is /k/). The inventory supports the language's eight to ten vowel qualities, depending on dialect.[18][41][40] Diphthongs are represented either independently (e.g., ꨄ for /ai/) or via dependent signs (e.g., ꨰ for /ai/, ꨬ for /ei/), with rendering rules ensuring proper stacking or linearization in complex syllables. Dialectal variations, particularly in Eastern versus Western Cham, influence vowel length distinctions, such as short /a/ (inherent) versus long /aː/ (marked by ꨩ), where nasal environments may shift short /a/ to /ɨ/ without additional marking; long forms like /uː/ are similarly derived by combining short /u/ (ꨭ) with ꨩ.[41][40] In the Unicode standard, independent vowels are assigned to the range U+AA00–U+AA05, while the dependent signs occupy U+AA29–U+AA32, facilitating digital implementation and consistent rendering across fonts.[18]Medial and Final Consonants
In the Cham script, medial consonants represent sounds occurring between the initial consonant and vowel within a syllable, forming consonant clusters such as /kr/ or /kl/. For Eastern Cham (Akhar Thrah), there are four primary medial consonant diacritics, which are combining marks placed as subscripts below the initial consonant: ꨳ for /j/ (ya), ꨴ for /r/ (ra), ꨵ for /l/ (la), and ꨶ for /w/ (wa).[41][18] These are encoded in Unicode at U+AA33 to U+AA36 and can combine in sequences like -rj- or -rw- to indicate more complex clusters, as in ꨆꨴꨩ (kraː).[41] In Western Cham (Akhar Srak), medial representation expands to include additional subjoined forms such as signs for /ɣ/ (r), /l/, /j/ (y), and /v/, often arising from vowel deletion in unstressed syllables, resulting in clusters like [kl] or [kɣ].[15] These are proposed for Unicode encoding as distinct combining characters to handle dialectal variations.[15] Final consonants, which form the syllable coda in Cham's CV(C) structure, are not indicated by a virama (vowel killer) as in many Brahmic scripts; instead, Eastern Cham employs a set of explicit final consonant letters or combining diacritics attached to the preceding vowel-bearing form.[40][42] Common final forms include ꩀ (-k), ꩁ (-g), ꩂ (-ŋ), ꩅ (-t), ꩆ (-n), ꩇ (-p), ꩉ (-r), ꩊ (-l), ꩌ (-m), and ꩍ (-h), totaling around ten frequently used endings that account for most native codas, with the remainder using less common letters like ꩄ (-ch) or ꩈ (-y).[41][18] These are encoded primarily in Unicode's Cham block at U+AA40–U+AA4D, where final letters (U+AA40–U+AA4B) appear as standalone reduced shapes, and combining signs like ꩃ (final -ŋ at U+AA43) or ꩌ (final -m at U+AA4C) attach below.[18] An example is ꨀꨪꩆ (ʔin, "day").[41] In Western Cham, final consonants similarly use explicit reduced forms, such as FINAL K, FINAL NG, FINAL T, FINAL P, and FINAL M, but with dialect-specific extensions like FINAL B or FINAL V to capture phonetic variations, including lengthened vowels or implosives.[15] The cursive handwriting style prevalent in Western Cham manuscripts often renders clusters and finals illegible, as strokes blend, leading to ambiguities like distinguishing FINAL B from other marks, which requires contextual interpretation.[15] Phonetic simplifications are common, such as nasal assimilation where a final nasal merges with a following consonant (e.g., /ŋk/ realized as [ŋk] or simplified to [ŋ]), or replacement of final /r/ with vowel lengthening in some dialects.[15] These features highlight the script's adaptation to spoken Cham's syllable structure while maintaining Brahmic orthographic principles without virama suppression.[40]Numerals and Additional Symbols
The Cham script employs a set of ten distinct numerals to represent the digits from 0 to 9, primarily used in historical inscriptions for recording dates, quantities, and other numerical data. These numerals follow the Brahmic tradition and appear in traditional Cham texts alongside the alphabetic characters. Representative examples include the numeral for zero (꩐), one (꩑), five (꩕), and nine (꩙).[43]| Digit | Numeral |
|---|---|
| 0 | ꩐ |
| 1 | ꩑ |
| 2 | ꩒ |
| 3 | ꩓ |
| 4 | ꩔ |
| 5 | ꩕ |
| 6 | ꩖ |
| 7 | ꩗ |
| 8 | ꩘ |
| 9 | ꩙ |
Digital Encoding
Unicode for Eastern Cham
The Eastern Cham script was allocated in Unicode Standard version 5.1, released on April 4, 2008, within the dedicated Cham block spanning U+AA00 to U+AA5F in the Basic Multilingual Plane. This block encompasses 83 assigned characters, including 33 consonant forms (initial, medial, and final), 6 independent vowels, 17 dependent vowel marks, 10 digits, and various combining marks and symbols essential for representing the script's abugida structure.[18] The encoding prioritizes the Eastern variant used by Cham communities in Vietnam and Cambodia, reflecting its Brahmi-derived orthography without a virama for consonant clusters. Rendering Eastern Cham text demands support for complex script shaping, as it follows Indic-like clustering where base consonants combine with above, below, left, or right vowel signs and matras. OpenType font features, including glyph reordering (e.g., via the 'akhn' and 'rphf' tables for akhand ligatures and reph forms) and positioning (e.g., 'blwf' for below-base forms and 'pstf' for post-base forms), are required to correctly display vowel diacritics and stacked elements within syllables. The Unicode algorithm processes input in logical order, with rendering engines like HarfBuzz or Uniscribe applying these features to produce visual forms, ensuring proper stacking and ligation without explicit virama usage. Font support for Eastern Cham has expanded since the encoding's introduction, with Google's Noto Sans Cham font providing comprehensive coverage of the block since its release in 2013 as part of the Noto project aimed at universal script harmonization. This font, along with variants in the Noto family, incorporates the necessary OpenType tables for accurate rendering and is freely available for digital applications. In practical use, Eastern Cham Unicode is integrated into mobile keyboards, such as the Keyman Eastern Cham layout, which enables input on Android and iOS devices for Vietnamese Cham users through phonetic mapping and script conversion.[45] Additionally, custom fonts like EFEO Cam Times, developed to Unicode standards, support educational and publishing tools in Vietnam.[46] Eastern Cham encoding maintains compatibility with Latin-based transliterations, commonly used in scholarly and input contexts, through Unicode Normalization Form C (NFC), which canonicalizes decomposed diacritics into composed forms where applicable, facilitating seamless conversion in digital workflows.[47] This normalization ensures that Latin representations of Cham sounds, such as those in the EFEO system, can be reliably mapped to native script output without loss of equivalence in processing.[47]Encoding Status for Western Cham
The encoding of the Western Cham script, used primarily in Cambodia, has progressed through multiple proposals since its initial submission to the Unicode Technical Committee (UTC) in 2016 by Michael Everson and Andrew Cunningham.[48] Subsequent revisions, including a key document in 2019 by Martin Hosken and further updates in 2020 and 2022, sought to standardize approximately 87 characters in a dedicated block.[49][50][15] However, as of November 2025, the script remains in a tentative allocation within the Supplementary Multilingual Plane (SMP) roadmap at U+1E200–U+1E2FF, without full encoding in the Unicode Standard. As of Unicode 17.0 (September 2025), Western Cham remains unencoded, with inclusion anticipated in a future version around 2026.[51] Unlike the established Eastern Cham block (U+AA00–U+AA5F), which supports the Vietnamese variant, the Western Cham proposal requires over 80 additional characters to accommodate unique features such as cursive joining ligatures (e.g., three special forms at proposed U+1E23A–U+1E23C) and hybrid forms integrating Arabic influences, including an extension at U+061D.[50] These additions address the script's divergence, including attached vowel symbols and subscript consonants specific to Cambodian usage. Proposals have faced stability challenges due to variant forms, such as two literary vowel alternatives and stylistic font variations (e.g., at proposed U+1E235), leading to ongoing refinements to ensure community consensus and avoid disunification issues.[50][15] In the absence of official encoding, Western Cham text is often handled through workarounds like mapping characters to the Private Use Area (PUA) in Unicode (e.g., U+E000–U+F8FF) or relying on Latin-based transliterations in digital tools and software. Font development remains limited, with efforts primarily led by Cambodian linguists and international collaborators to create custom typefaces compatible with existing systems, though these lack broad interoperability.[52][39] Looking ahead, inclusion in the Unicode Standard is anticipated contingent on UTC review and finalization expected around 2026, which would enable better digital preservation and usage for the Western Cham community.[51][53]Illustrative Examples
Sample Text in Eastern Cham
A traditional Eastern Cham text from a 19th-century manuscript is an excerpt from the Damnây Po Inâ Nâgar, an ode-like narrative hymn honoring the mother goddess Po Inâ Nâgar, central to Cham Hindu religious practices. This genre blends poetic praise with mythological storytelling, often recited in rituals at temples like Po Nagar in Nha Trang, Vietnam. The following short excerpt illustrates the script's full orthography, including independent and dependent vowels, consonant clusters, and the danda (।) as a verse separator. For illustration, a standard sample proverb is provided below, as specific manuscript excerpts require paleographic verification beyond standard encoding.[54] Eastern Cham Script Excerpt:ꨤꨨꨪꩀ ꨎꨳꨯꨮꩆ ꨕꨴꨭꩅ ꨕꨴꨭꩈ ꨨꨕꨯꩌ ꨨꨣꨬ
ꨤꨨꨪꩀ ꨧꨝꩅ ꨕꨭꩀ ꨕꨭꩈ ꨨꨕꨯꩌ ꨝꨪꨤꨩꩆ
ꨤꨨꨪꩀ ꨚꨢꨯꨩ ꨚꨕꨴꨭꩅ ꨚꨕꨴꨭꩈ ꨨꨕꨯꩌ ꨔꨭꩆ
ꨤꨨꨪꩀ ꨀꨟꨯꨮꩀ ꨧꨭꨂꩀ ꨧꨭꨂꨅꩆ ꨨꨩ ꨂꨟꨯꩉ Transliteration:
Lahik jiên; drut druy hadôm harei
Lahik sabat; duk duy hadôm bilaan
Lahik payô; padrut padruy hadôm thun
Lahik Amêk; su-uk su-uôn ha umôr English Translation:
Loss of money; Sad for a few days.
Loss of friends; Sad for a few months.
Loss of girlfriend; Sad for a few years.
Loss of mother; Sad for life. This proverb-like text demonstrates everyday language usage, though religious texts like the Damnây follow similar orthographic conventions.[41] Annotation of Key Features:
- The word "Lahik" (loss) uses ꨤ (la, U+AA24) with dependent vowels and finals, showing abugida stacking.
- Vowel modifications, such as ꩀ (final k, U+AA40) and ꩆ (aa, U+AA46), illustrate length and quality distinctions adapted for Austronesian sounds.
- Consonant clusters like "drut" employ ꨎ (da, U+AA0E) with ꨳ (ru, U+AA33) and dependent forms.
- The danda (।, U+0964, shared from Devanagari but used in Cham) separates lines, reflecting Brahmic heritage in religious and poetic contexts. These elements highlight the script's adaptation for Eastern Cham phonology.[18]
Sample Text in Western Cham
A representative verse from a love poem in Western Cham, transcribed from Cambodian oral traditions, illustrates the script's cursive connections and incorporation of Arabic loanwords in secular literature. Such poems, part of genres like ariya (lyrical narratives often exploring romantic themes), blend local expressions of affection with Islamic influences to evoke enduring bonds. An example phrase from an ariya appears below. As Western Cham (Akhar Srak) is not yet encoded in Unicode as of 2025, the sample is provided in ALA-LC transliteration.[9][15] Western Cham Transliteration:ngap dher pataow pa-abih takai karai – il limo patih ka ber thuw hai English Translation:
(From Ariya Ppo Riyak) A rough translation captures themes of longing and separation in love: "Take the path of the abandoned one – a thousand hardships to meet again." This rendering highlights the script's lack of inter-word spaces, requiring contextual reading, in contrast to modern printed forms. Hybrid punctuation, combining Indic danda with Arabic-inspired diacritics, structures rhythm. Arabic loanwords like salam (peace) often infuse romantic pleas with spiritual depth, reflecting synthesis of influences during the 17th–19th centuries under figures like Po Romé.[9] In breakdown (based on orthographic description): The initial consonants connect fluidly in cursive ligatures; angular forms distinguish Western from Eastern variants, with elongated finals for poetic emphasis.[15]