Tocharian script
View on Wikipedia| Tocharian script | |
|---|---|
Kizil Caves standing Buddha. Often attributed in the past to the 7th century AD,[1] but now carbon dated to AD 245-340.[2] Tocharian B inscription reading: Se pañäkte saṅketavattse ṣarsa papaiykau "This Buddha was painted by the hand of Sanketava".[3][4][5][6] | |
| Script type | |
Period | 8th century |
| Languages | Tocharian languages |
| Related scripts | |
Parent systems | |
Sister systems | Gupta, Tamil-Brahmi, Bhattiprolu, Sinhala |
| Brahmic scripts |
|---|
| The Brahmi script and its descendants |
The Tocharian script,[7] also known as Central Asian slanting Gupta script or North Turkestan Brāhmī,[8] is an abugida which uses a system of diacritical marks to associate vowels with consonant symbols. Part of the Brahmic scripts, it is a version of the Indian Brahmi script. It is used to write the Central Asian Indo-European Tocharian languages, mostly from the 8th century (with a few earlier ones, probably as early as AD 300)[9] that were written on palm leaves, wooden tablets and Chinese paper, preserved by the extremely dry climate of the Tarim Basin. Samples of the language have been discovered at sites in Kucha and Karasahr, including many mural inscriptions. Mistakenly identifying the speakers of this language with the Tokharoi people of Tokharistan (the Bactria of the Greeks), early authors called these languages "Tocharian". This naming has remained, although the names Agnean and Kuchean have been proposed as a replacement.[10][7]
Tocharian A and B are not mutually intelligible. Properly speaking, based on the tentative interpretation of twqry as related to Tokharoi, only Tocharian A may be referred to as Tocharian, while Tocharian B could be called Kuchean (its native name may have been kuśiññe), but since their grammars are usually treated together in scholarly works, the terms A and B have proven useful. A common Proto-Tocharian language must precede the attested languages by several centuries, probably dating to the 1st millennium BC. Given the small geographical range of and the lack of secular texts in Tocharian A, it might alternatively have been a liturgical language, the relationship between the two being similar to that between Classical Chinese and Mandarin. However, the lack of a secular corpus in Tocharian A is by no means definite, due to the fragmentary preservation of Tocharian texts in general.
History
[edit]The Tocharian script is derived from the Brahmi alphabetic syllabary (abugida) and is referred to as slanting Brahmi. It soon became apparent that a large proportion of the manuscripts were translations of known Buddhist works in Sanskrit and some of them were even bilingual, facilitating decipherment of the new language. Besides the Buddhist and Manichaean religious texts, there were also monastery correspondence and accounts, commercial documents, caravan permits, and medical and magical texts, and one love poem. Many Tocharians embraced Manichaean duality or Buddhism.
In 1998, Chinese linguist Ji Xianlin published a translation and analysis of fragments of a Tocharian Maitreyasamiti-Nataka discovered in 1974 in Yanqi.[11][12][13]
The Tocharian script probably died out after 840, when the Uyghurs were expelled from Mongolia by the Kyrgyz, retreating to the Tarim Basin. This theory is supported by the discovery of translations of Tocharian texts into Uyghur. During Uyghur rule, the peoples assimilated by[clarification needed] the Turkic speaking Uyghurs now in Xinjiang.[citation needed]
Script
[edit]The Tocharian script is based on Brahmi, with each consonant having an inherent vowel, which can be altered by adding a vowel mark or removed by a special nullifying mark, the virama. Like Brahmi, Tocharian uses stacking for conjunct consonants and has irregular conjunct forms of
, ra.[14] Unlike other Brahmi scripts, Tocharian has a second set of characters called Fremdzeichen that double up several of the standard consonants, but with an inherent "Ä" vowel.[15] The eleven Fremdzeichen are most often found as substitutes for the standard consonant+virama in conjuncts, but they can be found in any context other than with the explicit "Ä" vowel mark. Fremdzeichen as consonant+virama is not found in later Tocharian texts.
Table of Tocharian letters
[edit]| Independent | A | Ā | I | Ī | U | Ū |
|---|---|---|---|---|---|---|
| R̥ | R̥̄ | E | Ai | O | Au | Ä |
| Vowel diacritics (here applied on as an example) |
Tha | Thā | Thi | Thī | Thu | Thū |
| Thr̥ | Thr̥̄ | The | Thai | Tho | Thau | Thä |
| Velars | Ka | Kha | Ga | Gha | Ṅa |
|---|---|---|---|---|---|
| Standard | |||||
| Fremdzeichen | |||||
| Palatals | Ca | Cha | Ja | Jha | Ña |
| Retroflexes | Ṭa | Ṭha | Ḍa | Ḍha | Ṇa |
| Dentals | Ta | Tha | Da | Dha | Na |
| Standard | |||||
| Fremdzeichen | |||||
| Labials | Pa | Pha | Ba | Bha | Ma |
| Standard | |||||
| Fremdzeichen | |||||
| Sonorants | Ya | Ra | La | Va | |
| Standard | |||||
| Fremdzeichen | |||||
| Sibilants | Śa | Ṣa | Sa | Ha | |
| Standard | |||||
| Fremdzeichen |
| Visarga | Anusvara | Virama (on |
Jihvamuliya | Upadhmaniya |
Evolution from Brahmi to Tocharian
[edit]
Manuscripts in Sanskrit, using Middle Brahmi script and the Kushan period, and carbon dated to the 2nd century CE, have been discovered in the Tarim Basin, and particularly at Kizil. Some of the fragments, quite possibly the oldest Sanskrit manuscript of any type related to Buddhism and Hinduism discovered so far, were discovered in 1906 in the form of a pile of more than 1,000 palm leaf fragments in the Ming-oi, Kizil Caves, during the third Turfan expedition headed by Albert Grünwedel. The calibrated age of the manuscript by Carbon-14 technique is 130 CE (80–230 CE), corresponding to the rule of the Kushan king Kanishka.
The Tocharian script evolved from the Middle Brahmi script of the Kushan Empire:[16]
| a | i | u | e | o | k- | kh- | g- | gh- | ṅ- | c- | ch- | j- | jh- | ñ- | ṭ- | ṭh- | ḍ- | ḍh- | |
| Brahmi | 𑀅 | 𑀇 | 𑀉 | 𑀏 | 𑀑 | 𑀓 | 𑀔 | 𑀕 | 𑀖 | 𑀗 | 𑀘 | 𑀙 | 𑀚 | 𑀛 | 𑀜 | 𑀝 | 𑀞 | 𑀟 | 𑀠 |
| Kushan Brahmi | |||||||||||||||||||
| Tocharian |
| ṇ- | t- | th- | d- | dh- | n- | p- | ph- | b- | bh- | m- | y- | r- | l- | v- | ś- | ṣ- | s- | h- | |
| Brahmi | 𑀡 | 𑀢 | 𑀣 | 𑀤 | 𑀥 | 𑀦 | 𑀧 | 𑀨 | 𑀩 | 𑀪 | 𑀫 | 𑀬 | 𑀭 | 𑀮 | 𑀯 | 𑀰 | 𑀱 | 𑀲 | 𑀳 |
| Kushan Brahmi | |||||||||||||||||||
| Tocharian |
Unicode
[edit]References
[edit]- ^ Härtel, Herbert; Yaldiz, Marianne; Kunst (Germany), Museum für Indische; N.Y.), Metropolitan Museum of Art (New York (1982). Along the Ancient Silk Routes: Central Asian Art from the West Berlin State Museums : an Exhibition Lent by the Museum Für Indische Kunst, Staatliche Museen Preussischer Kulturbesitz, Berlin, Federal Republic of Germany. Metropolitan Museum of Art. p. 107. ISBN 978-0-87099-300-8.
- ^ Waugh (Historian, University of Washington), Daniel C. "MIA Berlin: Turfan Collection: Kizil". depts.washington.edu.
- ^ Härtel, Herbert; Yaldiz, Marianne; Kunst (Germany), Museum für Indische; N.Y.), Metropolitan Museum of Art (New York (1982). Along the Ancient Silk Routes: Central Asian Art from the West Berlin State Museums : an Exhibition Lent by the Museum Für Indische Kunst, Staatliche Museen Preussischer Kulturbesitz, Berlin, Federal Republic of Germany. Metropolitan Museum of Art. p. 107. ISBN 978-0-87099-300-8.
- ^ Le Coq, Albert von. Die Buddhistische Spätantike in Mittelasien : vol.5. p. 10.
- ^ "A dictionary of Tocharian B". www.win.tue.nl.
- ^ In Ashokan Brahmi: 𑀲𑁂𑀧𑀜𑀓𑁆𑀢𑁂 𑀲𑀡𑁆𑀓𑁂𑀢𑀯𑀝𑁆𑀲𑁂 𑀱𑀭𑁆𑀲 𑀧𑀧𑁃𑀬𑁆𑀓𑁅
- ^ a b Diringer, David (1948). Alphabet A Key To The History Of Mankind. pp. 347–348.
- ^ "BRĀHMĪ – Encyclopaedia Iranica". www.iranicaonline.org. Retrieved 2019-03-28.
- ^ Earliest paintings from Kizil Caves with Tocharian inscriptions, now carbon dated to AD 245-340, see Waugh (Historian, University of Washington), Daniel C. "MIA Berlin: Turfan Collection: Kizil". depts.washington.edu.
- ^ Namba Walter, Mariko (October 1998). "Tokharian Buddhism in Kucha: Buddhism of Indo-European Centum Speakers in Chinese Turkestan before the 10th Century C.E." (PDF). Sino-Platonic Papers. 85: 2-4.
- ^ "Fragments of the Tocharian", Andrew Leonard, How the World Works, Salon.com, January 29, 2008
- ^ "Review of 'Fragments of the Tocharian A Maitreyasamiti-Nataka of the Xinjiang Museum, China. In Collaboration with Werner Winter and Georges-Jean Pinault by Ji Xianlin'", J. C. Wright, Bulletin of the School of Oriental and African Studies, University of London, Vol. 62, No. 2 (1999), pp. 367–370
- ^ "Fragments of the Tocharian a Maitreyasamiti-Nataka of the Zinjiang Museum, China", Ji Xianlin, Werner Winter, Georges-Jean Pinault, Trends in Linguistics, Studies and Monographs
- ^ Gippert, Jost. "Tocharian Brahmi Script". TITUS Didactica. Retrieved 8 May 2013.
- ^ Wilson, Lee. "Proposal to Encode the Tocharian Script (in the Unicode Standard / ISO 10646)" (PDF). Retrieved 2020-06-09.
- ^ Joshi, R. Malatesha; McBride, Catherine (11 June 2019). Handbook of Literacy in Akshara Orthography. Springer. p. 27. ISBN 978-3-030-05977-4.
- ^ Diringer, David (1948). Alphabet A Key To The History Of Mankind. pp. 247–248.
- ^ "ScriptSource - Entry - Unicode Status (Tocharian)".
External links
[edit]- TITUS: Tocharian alphabets, conjugation tables, and manuscripts from the Berlin Turfan Collection
- A Tocharian-to-English dictionary with nearly 200 words with accompanying article
- Tocharian Online by Todd B. Krause and Jonathan Slocum, University of Texas at Austin
- Tocharian alphabet at Omniglot.com
- Online: "A dictionary of Tocharian B". www.win.tue.nl., also in print: Adams, Douglas Q. (2013). A Dictionary of Tocharian B. Rodopi. ISBN 978-90-420-3671-0.
- CEToM: A Comprehensive Edition of Tocharian Manuscripts
Tocharian script
View on GrokipediaBackground
Linguistic and historical context
The Tocharian languages represent an extinct branch of the Indo-European language family, distinct from the more familiar Indo-Iranian and Greco-Roman branches, and are known primarily through two dialects: Tocharian A, also called Turfanian or Agnean, and Tocharian B, known as Kuchean.[4] These dialects were spoken by communities in the oases of the Tarim Basin, located in present-day Xinjiang, northwest China, with manuscripts dating from around the 4th to the 13th centuries CE; Tocharian B is attested from before 400 CE to ca. 1200 CE, while Tocharian A dates from the 7th to 10th centuries CE.[5][2] The arid climate of the region preserved over 7,600 manuscripts and fragments, providing the primary evidence for these languages, which exhibit centum characteristics aligning them more closely with Western Indo-European branches like Celtic and Italic.[4][2] In the historical context of Central Asia, Tocharian speakers inhabited key nodes along the Silk Road trade routes, facilitating extensive cultural and linguistic exchanges in the Tarim Basin from the 1st millennium BCE onward.[5] The region served as a crossroads where Buddhism, originating from India, flourished from the 2nd century CE, profoundly influencing Tocharian society through the establishment of monasteries and the translation of sacred texts.[5] Interactions with neighboring cultures were multifaceted: Indo-Aryan influences arrived via Sanskrit and Gāndhārī loanwords related to Buddhism (e.g., bodhisātve 'bodhisattva'), Iranian elements through Saka and Sogdian contacts (e.g., etswe 'mule' from Old Iranian), and Chinese impacts via trade and administrative ties under dynasties like the Tang.[4] These exchanges enriched Tocharian vocabulary while maintaining its core Indo-European structure, highlighting the basin's role as a melting pot of Eurasian civilizations.[5] The Tocharian script, derived from the Brahmi family of Indian origin, played a crucial role in documenting a diverse corpus that belied the language's European linguistic affinities.[4] It was employed to record predominantly Buddhist literature, including sutras, commentaries, and monastic hymns, alongside administrative documents such as caravan passes and contracts, and occasional secular texts like poetry and medical treatises.[5] This orthographic choice reflected the deep penetration of Indo-Aryan cultural practices through Buddhist dissemination, contrasting sharply with Tocharian's phonological and grammatical ties to non-Asiatic Indo-European languages.[4] The decline and extinction of Tocharian occurred around the 9th to 10th centuries CE, accelerated by waves of Turkic migrations into the Tarim Basin, including the Uyghur influx around 840 CE and subsequent Karakhanid expansions.[6] Under Tibetan rule from the late 8th century and later Turkic dominance, Tocharian lost its status as a prestige language, leading to bilingualism and gradual shift toward Old Turkic, with the final manuscripts appearing by the 11th to 13th centuries before complete language death.[6][2] This process was compounded by the spread of Islam and the suppression of Buddhism, erasing Tocharian from the linguistic landscape of the region.[6]Discovery and decipherment
The discovery of Tocharian manuscripts primarily occurred during early 20th-century archaeological expeditions in the Tarim Basin, particularly the German Turfan expeditions conducted between 1902 and 1914. These expeditions, led by Albert Grünwedel and Albert von Le Coq, explored sites in the Turfan oasis, Kucha, and surrounding regions of Xinjiang, uncovering thousands of ancient documents amid the ruins of Buddhist monasteries.[7] The teams recovered over 40,000 fragments in total, including significant numbers written in the Tocharian script on materials such as paper, birch bark, and wood slips, many of which were Buddhist texts preserved in arid cave conditions.[8] The decipherment of the Tocharian script and languages began shortly after these finds reached European collections, with pivotal contributions from German scholars Emil Sieg and Wilhelm Siegling. In 1908, they published an article titled "Tocharisch," in which they identified the unknown language as Indo-European based on grammatical analysis and recognition of familiar roots, distinguishing two dialects: Tocharian A (from the Turfan region) and Tocharian B (from Kucha).[2] Their breakthrough relied heavily on bilingual texts, such as Sanskrit-Tocharian manuscripts like the Udānavarga, where parallel passages allowed for comparative translation and script interpretation. Additional support came from Tocharian-Chinese bilinguals, which provided contextual clues despite linguistic barriers. Sieg and Siegling's subsequent multi-volume work, Tocharische Sprachreste (1921–1953), presented transliterations, facsimiles, and glossaries of key fragments, solidifying the decipherment despite the challenges posed by the highly fragmented state of the papyri and birch-bark manuscripts, many of which were incomplete or damaged by age and environmental exposure.[9] Today, the major collections of Tocharian manuscripts are housed in institutions such as the Berlin State Library, which holds approximately 4,000 fragments from the Turfan expeditions, and the British Library, with over 800 digitized items.[10][11] In total, more than 7,600 Tocharian documents have been cataloged worldwide, enabling ongoing scholarly access through digitization projects.[2]Development
Origins in Brahmi script
The Brahmi script originated in ancient India around the 3rd century BCE, as evidenced by the rock edicts of Emperor Ashoka, marking its earliest attested use in Prakrit inscriptions across the subcontinent.[12] This abugida system, characterized by syllabic consonants with inherent vowels, served as the foundational writing tradition for numerous Indo-Aryan languages and spread northward with the expansion of Buddhism.[13] The transmission of Brahmi to Central Asia occurred primarily through the Kushan Empire (1st–3rd century CE), a multicultural realm that facilitated cultural exchanges along the Silk Road from northwestern India to Bactria and beyond.[14] Kushan rulers, such as Kanishka, promoted Buddhism and employed Brahmi alongside other scripts like Kharoshthi for administrative and religious purposes, including Sanskrit and Gandhari Prakrit inscriptions in regions like Mathura and Gandhara.[15] The script reached the Tarim Basin by the 2nd century CE via Kushan intermediaries, as evidenced by the Spitzer manuscript (ca. 130 CE), a Sanskrit philosophical text in Kushan Brahmi found at Kizil, introducing Brahmi for Buddhist and other documents in local multilingual environments.[16] In the Tarim Basin, initial adaptations of Brahmi for the Tocharian languages preserved core features, such as the left-to-right direction, while developing cursive forms suited to writing on palm leaves, wood, and later paper.[2] The system incorporated more explicit notation of vowels through diacritics and the addition of specific signs (Fremdzeichen) for Tocharian phonemes like /ä/ and certain consonants, somewhat reducing reliance on the inherent vowel typical of abugidas.[17] This evolution is evident in the North Turkestan Brahmi variant, also known as Slant Brahmi, which incorporated stacked akṣaras and a bar virama for consonant clusters, reflecting phonetic needs distinct from Indian Prakrit.[17] Earliest evidence of Brahmi in the Tarim region includes the Spitzer manuscript from the 2nd century CE, stylistically linked to Kushan Brahmi variants, found in oasis sites like Kizil and predating full Tocharian attestation around the 5th century CE.[18] These precursors, often in Prakrit or Sanskrit, demonstrate the script's establishment before its localization for Tocharian A and B dialects.[2]Evolution and regional adaptations
The Tocharian script evolved through distinct chronological phases after its adaptation in the Tarim Basin. The initial "Slant Brahmi" phase, also termed Tarim Gupta, emerged in the 4th–5th centuries CE, deriving from Indian Gupta script and primarily used for Sanskrit texts alongside early Tocharian B inscriptions.[19] By the 5th–6th centuries CE, transitional forms known as Early Tarim Brahmi A and B developed, reflecting initial local modifications for Tocharian phonology while retaining Brahmi's abugida structure.[19] The mature phase from the 6th–8th centuries CE introduced cursive styles, evident in both calligraphic Buddhist manuscripts and secular documents on northern and southern Tarim routes, with increased fluidity in letter forms to accommodate faster writing.[20] In the late 8th–10th centuries CE, the "Late Tocharian" phase featured simplified strokes and reduced complexity, coinciding with archaic, classical, and late distinctions in Tocharian B manuscripts, and exclusively for all known Tocharian A texts.[20] Regional variants of the script reflected geographic and cultural differences across the Tarim Basin. Western styles, centered in Kucha, appeared in the 5th–6th centuries CE as Early Tarim Brahmi A, linked to northern Silk Road transmission and used for Tocharian B in administrative and religious contexts.[19] Eastern variants in the Turfan region, known as Tarim Brahmi North 1 and 2, developed by the 6th–7th centuries CE and served multiple languages including Tocharian A and B, Sanskrit, and Tumshuqese, with evidence from over 4,000 manuscripts showing consistent but locally adapted letter proportions.[19] Central areas like Šorčuq exhibited intermediate forms blending western and eastern traits, as seen in dialectal manuscript distributions.[20] The script incorporated influences and adaptations to suit Tocharian's phonological needs and local materials. Diacritics such as the Fremdvokal for the central vowel /ä/ and Fremdzeichen (11 consonant-vowel signs) were added to represent non-native Indo-European sounds absent in standard Brahmi, enabling precise notation of Tocharian's unique features like stacked aksaras and bar virama.[17] Adaptations for writing media included cursive ligatures and abbreviations in paper-based manuscripts, which predominated from the 6th century CE onward, alongside earlier uses on wooden tablets; these changes facilitated denser text in Buddhist and secular documents copied on imported Chinese paper.[20] The Tocharian script's decline began in the late 8th century CE, accelerating after 840–866 CE with Uyghur migration into the Tarim Basin, where Turkic dominance suppressed Tocharian Buddhist institutions and promoted bilingualism.[6] By the 10th–11th centuries, the script was largely replaced by the Old Uyghur script (derived from Sogdian), as Turkic languages like Uyghur became administrative and religious standards under Karakhanid rule, leading to the script's extinction by the 13th–14th centuries CE amid Islamization and cultural shifts.[6]Description
Character set and alphabet
The Tocharian script is an abugida derived from the Late Brahmi script used in the Kushan Empire, featuring approximately 33–44 consonant akṣaras (syllabic units, including variants and Fremdzeichen) with an inherent vowel /a/ and 8–13 vowel signs (independent and dependent forms). This character set was adapted around the 5th to 8th centuries CE to represent the phonology of the Tocharian languages, including a core of 24 consonants in the standard Indic varga (class) order—ka to ma, ya to ha—supplemented by aspirated stops (e.g., kha, gha) borrowed from Indo-Aryan traditions and regional variants for sibilants and semi-vowels. Despite the script's retention of voiced and aspirated letters from Brahmi, Tocharian phonology merged them with voiceless unaspirated stops, using the extra letters for etymological or loanword distinctions.[17][21] The script accommodates Tocharian's seven simple vowels (i, u, e, o, a, ä, ā) and three diphthongs (ai, au, oi) primarily in Tocharian B, with some in A from loans or archaic forms; there is no phonemic vowel length distinction in the language itself, though the script retains long vowel markers from its Brahmi origins. Archaic forms, closer to Gupta-period Brahmi, appear in early 5th–6th century inscriptions from Kucha and Turfan, while standard, more cursive variants dominate 7th–8th century manuscript evidence from the Tarim Basin.[22]Vowels
The vowel system includes independent letters for word-initial positions and dependent diacritics for following consonants. Tocharian phonology features short vowels only, with <ā> representing a low central [a] rather than a long vowel; diphthongs are limited primarily to Tocharian B. Unique to Tocharian are the central vowel <ä> [ɨ] (high) and adaptations for non-Indic sounds, often marked with a special "Fremdvokal" sign (AE) in archaic texts.[23]| Independent Form (Romanized) | Dependent Form | IPA Value | Notes |
|---|---|---|---|
| A | -a | [ə] or [ʌ] (central unrounded) | Inherent vowel; standard short a. Archaic open form in early manuscripts. |
| Ā | -ā | [a] (low central) | Transcription convention for short low vowel; no true length. |
| I, Ī | -i, -ī | [i] (high front) | Short i; long marker unused phonemically. |
| U, Ū | -u, -ū | [u] (high back) | Short u; rounded back vowel. |
| E | -e | [e] (mid front) | Unrounded mid front; common in verbal endings. |
| O | -o | [o] (mid back) | Rounded mid back. |
| Ä (AE, Fremdvokal) | -ä | [ɛ] or [ə] (central) | Unique to Tocharian; often superscript dot or special stroke in standard variant. |
| AI | -ai | [ai̯] (diphthong) | Low central with front off-glide; primarily Tocharian B, some in A. |
| AU | -au | [au̯] (diphthong) | Low central with back off-glide; primarily Tocharian B, some in A. |
| OI | -oi | [oi̯] (diphthong) | Mid back with front off-glide; rare, Tocharian B. |
Consonants
Consonants are organized into five varga groups (gutturals, palatals, cerebrals, dentals, labials), plus semi-vowels, sibilants, and aspirates. Each carries an inherent /a/, removed via virāma (a horizontal stroke below) for clusters. Phonetic values align with voiceless stops and affricates ([p, t, k, ts, t͡ʃ]); no phonemic voicing or aspiration distinction exists, so letters for voiced and aspirated consonants represent voiceless unaspirated sounds, with aspirates sometimes denoting fricatives in positions; sibilants distinguish dental [s], retroflex [ʂ], and palatal [ɕ]. (Script letters for voiced stops represent voiceless phonemes; no phonemic voicing distinction.) Subscript (subjoined) forms handle clusters, e.g., -r (ra-phalā, a small r below), -y (ya-phalā, hooked y below), common for prenasalized or liquid sequences. Aspirated stops (kh, gh, etc.) derive from Indo-Aryan but often denote fricatives [x, ɣ] or breathy voice in Tocharian. Archaic variants feature angular strokes, while standard forms are rounded and cursive. Not all Brahmi letters are used; e.g., voiced aspirates (jh, bh) are rare. Retroflex letters (ṭ, ḍ, etc.) are retained for loanwords but represent dental/alveolar sounds in native words.[23][22][17][21]| Varga/Group | Romanized (with inherent a) | IPA Value | Notes |
|---|---|---|---|
| Gutturals | ka | [k] | Voiceless velar stop. |
| kha | [k] or [x] | Aspirate letter; represents voiceless stop or fricative in intervocalic positions. Archaic hooked form. | |
| ga | [k] | Voiced letter represents voiceless velar stop; from palatalized k in some dialects. | |
| gha | [k] or [ɣ] | Aspirate voiced letter; represents voiceless or fricative; Indo-Aryan influence; rare. | |
| ṅa (ṅ) | [ŋ] | Velar nasal; subjoined in clusters. | |
| Palatals | ca | [t͡ʃ] | Voiceless palatal affricate. |
| cha | [t͡ʃ] | Aspirate letter; used for [t͡ʃ] or [ɕ] sibilant. | |
| ja | [t͡ʃ] | Voiced letter represents voiceless affricate; rare. | |
| ña (ñ) | [ɲ] | Palatal nasal. | |
| Cerebrals | ṭa | [t] or [ts] | Retroflex letter; represents dental/alveolar voiceless stop or affricate in native words; for [ʈ] in loans. |
| ṭha | [t] | Aspirate retroflex letter; rare. | |
| ḍa | [t] | Voiced retroflex letter represents voiceless; uncommon. | |
| ṇa (ṇ) | [n] | Retroflex nasal letter; represents dental. | |
| ṣa (ṣ) | [ʂ] | Retroflex sibilant. | |
| Dentals | ta | [t] | Voiceless dental stop. |
| tha | [t] | Aspirate letter represents voiceless dental. | |
| da | [t] | Voiced letter represents voiceless dental stop. | |
| na (n) | [n] | Dental nasal. | |
| la (l) | [l] | Lateral approximant. | |
| sa (s) | [s] | Dental sibilant. | |
| Labials | pa | [p] | Voiceless bilabial stop. |
| pha | [p] | Aspirate letter represents voiceless bilabial. | |
| ba | [p] | Voiced letter represents voiceless bilabial stop. | |
| ma (m) | [m] | Bilabial nasal. | |
| Semi-vowels & Others | ya (y) | [j] | Palatal glide; subscript for palatalization. |
| ra (r) | [r] | Alveolar trill; subscript ra-phalā common. | |
| va (v) | [w] or [β] | Labial glide; varies by position. | |
| śa (ś) | [ɕ] | Palatal sibilant. | |
| ha (h) | [h] | Glottal fricative. |