Hubbry Logo
search
logo
2308798

Brahmic scripts

logo
Community Hub0 Subscribers
Read side by side
from Wikipedia

The Brahmic scripts, also known as Indic scripts, are a family of abugida writing systems. They are descended from the Brahmi script of ancient India and are used by various languages in several language families in South, East and Southeast Asia: Indo-Aryan, Dravidian, Tibeto-Burman, Mongolic, Austroasiatic, Austronesian, and Tai. They were also the source of the dictionary order (gojūon) of Japanese kana.[1]

History

[edit]

Brahmic scripts descended from the Brahmi script. Brahmi is clearly attested from the 3rd century BCE during the reign of Ashoka, who used the script for imperial edicts. Northern Brahmi gave rise to the Gupta script during the Gupta period, which in turn diversified into a number of cursives during the medieval period. Notable examples of such medieval scripts, developed by the 7th or 8th century, include Nagari, Siddham and Sharada.

The Siddhaṃ script was especially important in Buddhism, as many sutras were written in it. The art of Siddham calligraphy survives today in Japan. The tabular presentation and dictionary order of the modern kana system of Japanese writing is believed to be descended from the Indic scripts, most likely through the spread of Buddhism.[1]

Southern Brahmi evolved into the Kadamba, Pallava and Vatteluttu scripts, which in turn diversified into other scripts of South India and Southeast Asia. Brahmic scripts spread in a peaceful manner, Indianization, or the spread of Indian learning. The scripts spread naturally to Southeast Asia, at ports on trading routes.[2] At these trading posts, ancient inscriptions have been found in Sanskrit, using scripts that originated in India. At first, inscriptions were made in Indian languages, but later the scripts were used to write the local Southeast Asian languages. Hereafter, local varieties of the scripts were developed. By the 8th century, the scripts had diverged and separated into regional scripts.[3]

Characteristics

[edit]

Some characteristics, which are present in most but not all the scripts, are:

Comparison

[edit]

Below are comparison charts of several of the major Indic scripts, organised on the principle that glyphs in the same column all derive from the same Brahmi glyph. Accordingly:

  • The charts are not comprehensive. Glyphs may be unrepresented if they are later inventions not derived from any Brahmi character.
  • The pronunciations of glyphs in the same column may not be identical. The pronunciation row is only representative; the International Phonetic Alphabet (IPA) pronunciation is given for Sanskrit where possible, or another language if necessary.

The transliteration is indicated in ISO 15919.

Consonants

[edit]
ISO[a] ka kha ga gha ṅa ca cha ja jha ña ṭa ṭha ḍa ḍha ṇa ta tha da dha na ṉa pa pha/fa ba bha ma ya ẏa ra ṟa la ḷa ḻa va śa ṣa sa ha kṣa ṯa
Ashoka Brahmi 𑀓 𑀔 𑀕 𑀖 𑀗 𑀘 𑀙 𑀚 𑀛 𑀜 𑀝 𑀞 𑀟 𑀠 𑀡 𑀢 𑀣 𑀤 𑀥 𑀦 𑀧 𑀨 𑀩 𑀪 𑀫 𑀬 𑀭 𑀮 𑀴 𑀯 𑀰 𑀱 𑀲 𑀳  
Devanagari य़ क्ष  
Bengali-
Assamese
য় র,ৰ ক্ষ  
Sharada 𑆑 𑆒 𑆓 𑆔 𑆕 𑆖 𑆗 𑆘 𑆙 𑆚 𑆛 𑆜 𑆝 𑆞 𑆟 𑆠 𑆡 𑆢 𑆣 𑆤 𑆤𑇊 𑆥 𑆦 𑆧 𑆨 𑆩 𑆪 𑆪𑇊 𑆫 𑆫𑇊 𑆬 𑆭 𑆭𑇊 𑆮 𑆯 𑆰 𑆱 𑆲 𑆑𑇀𑆰  
Gurmukhi ਲ਼ ਸ਼  
Gujarati ક્ષ  
Odia କ୍ଷ  
Grantha 𑌕 𑌖 𑌗 𑌘 𑌙 𑌚 𑌛 𑌜 𑌝 𑌞 𑌟 𑌠 𑌡 𑌢 𑌣 𑌤 𑌥 𑌦 𑌧 𑌨 𑌪 𑌫 𑌬 𑌭 𑌮 𑌯 𑌰 𑌲 𑌳 𑌵 𑌶 𑌷 𑌸 𑌹 𑌕𑍍𑌷  
Tamil க்ஷ  
Telugu న఼ య఼ క్ష  
Kannada ನ಼ ಯ಼ ಕ್ಷ  
Malayalam ക്ഷ
Sinhala ක්‍ෂ  
Tibetan གྷ ཛྷ ཌྷ དྷ བྷ [b]  
ʼPhags-pa ꡂꡜ ꡆꡜ ꡫꡜ ꡊꡜ ꡎꡜ  
Meitei Mayek[c]  
Lepcha ᰡ᰷  
Limbu  
Tirhuta 𑒏 𑒐 𑒑 𑒒 𑒓 𑒔 𑒕 𑒖 𑒗 𑒘 𑒙 𑒚 𑒛 𑒜 𑒝 𑒞 𑒟 𑒠 𑒡 𑒢 𑒣 𑒤 𑒥 𑒦 𑒧 𑒨 𑒩 𑒪 𑒬 𑒭 𑒮 𑒯  
Kaithi 𑂍 𑂎 𑂏 𑂐 𑂑 𑂒 𑂓 𑂔 𑂕 𑂖 𑂗 𑂘 𑂙 𑂛 𑂝 𑂞 𑂟 𑂠 𑂡 𑂢 𑂣 𑂤 𑂥 𑂦 𑂧 𑂨 𑂩 𑂪 𑂫 𑂬 𑂭 𑂮 𑂯  
Newa Prachalit 𑐎 𑐏 𑐐 𑐑 𑐒 𑐔 𑐕 𑐖 𑐗 𑐘 𑐚 𑐛 𑐜 𑐝 𑐞 𑐟 𑐠 𑐡 𑐢 𑐣 𑐣𑑆 𑐥 𑐦 𑐧 𑐨 𑐩 𑐫 𑐫𑑆 𑐬 𑐬𑑆 𑐮 𑐮𑑆 𑐲𑑆 𑐰 𑐱 𑐲 𑐳 𑐴 𑐎𑑂𑐲
Sylheti Nagari  
Chakma[d] 𑄇 𑄈 𑄉 𑄊 𑄋 𑄌 𑄍 𑄎 𑄏 𑄐 𑄑 𑄒 𑄓 𑄔 𑄕 𑄖 𑄗 𑄘 𑄙 𑄚 𑄛 𑄜 𑄝 𑄞 𑄟 𑄠 𑄡 𑄢 𑄣 𑅄 𑄤 𑄥 𑄦  
Burmese က ဉ / ည  
Tai Tham ᨡ,ᨢ[e] ᨣ,ᨤ[e] ᨩ,ᨪ[e] ᨷ,ᨸ[e] ᨹ,ᨺ[e] ᨻ,ᨼ[e] ᨿ,ᩀ[e] ᩉ,ᩌ[e]  
New Tai Lue ᦅ,ᦆ[e] ᦋ,ᦌ[e] ᦡ,ᦤ[e] ᦢ,ᦥ[e],ᦔ[e] ᦕ,ᦚ[e] ᦗ,ᦝ[e] ᦍ,ᦊ[e]  
Khmer  
Thai ข,(ฃ)[f] ค,(ฅ)[f] ช,(ซ)[f] ฎ,[f](ฏ) ด,[f](ต) ป,[f](บ) ผ,(ฝ)[f] พ,(ฟ)[f] ห,ฮ[f]  
Lao [g] [g] [g] [g] [g] [g] [g] [g] [g] ດ,ຕ [g] ບ,ປ ຜ,ຝ ພ,ຟ [g] [g] [g] [g]  
Cham  
Kawi 𑼒 𑼓 𑼔 𑼕 𑼖 𑼗 𑼘 𑼙 𑼚 𑼛 𑼜 𑼝 𑼞 𑼟 𑼠 𑼡 𑼢 𑼣 𑼤 𑼥 𑼳 𑼦 𑼧 𑼨 𑼩 𑼪 𑼫 𑼬 𑼭 𑼮 𑼯 𑼰 𑼱 𑼲  
Balinese  
Javanese[h] [h] [h] [h] [h] [h] [h] [h] [h] [h] [h] [h] [h] [h]  
Sundanese [i] [j] [k]  
Lontara  
Makasar 𑻠 𑻡 𑻢 𑻩 𑻪 𑻫 𑻦 𑻧 𑻨 𑻣 𑻤 𑻥 𑻬 𑻭 𑻮 𑻯 𑻰 𑻱  
Rejang ꤿ  
Batak (Toba) /  
Baybayin                                   /            
Buhid                                              
Hanunuo                                              
Tagbanwa                                                    
ISO ka kha ga gha ṅa ca cha ja jha ña ṭa ṭha ḍa ḍha ṇa ta tha da dha na ṉa pa pha ba bha ma ya ẏa ra ṟa la ḷa ḻa va śa ṣa sa ha kṣa ṯa
Notes
  1. ^ This list tries to include characters of same origins, not same sounds. In Bengali র is pronounced as but it is originally va which is still used for wa sound in Mithilakshar and modern Assamese ৱ (wabbô) was derived from middle Assamese র (wô). Compare with জ (ja) য (ya) and য় (ẏ) which are pronounced as , and e̯ô in Bengali and , and in Assamese respectively. য is related to Devanagari य (ya) and it is still pronounced as "ya" in Mithilakshar. Since their sounds shifted, the dots were added to keep the original sounds.
  2. ^ Letter used in Balti.
  3. ^ includes supplementary consonants not in contemporary use
  4. ^ inherent vowel is ā
  5. ^ a b c d e f g h i j k l m n o p Modified forms of these letters are or were used for distinctions made in local language; these distinctions are not made for Sanskrit and Pali.
  6. ^ a b c d e f g h i Modified forms of these letters are or were used for distinctions made in Thai; these distinctions are not made for Sanskrit and Pali in the Thai script.
  7. ^ a b c d e f g h i j k l m n These letters are obsolete, but were used mainly for Sanskrit and Pali in the Lao script.
  8. ^ a b c d e f g h i j k l m n Letters used in Old Javanese. They are now obsolete, but are used for honorifics in contemporary Javanese.
  9. ^ Invented new character to represent the Arabic letter خ.
  10. ^ Letter used in Old Sundanese. It is now obsolete.
  11. ^ Invented new character. Actually to represent the Arabic letter ش, which has similar pronunciation with śa.

Vowels

[edit]

Vowels are presented in their independent form on the left of each column, and in their corresponding dependent form (vowel sign) combined with the consonant k on the right. A glyph for ka is an independent consonant letter itself without any vowel sign, where the vowel a is inherent.

ISO a ā ê ô i ī u ū e ē ai o ō au ə r̥̄[a] [a] l̥̄[a]  
a ka ā ê ô i ki ī u ku ū e ke ē ai kai o ko ō au kau ə kr̥ r̥̄ kr̥̄ kl̥ l̥̄ kl̥̄ aṁ kaṁ aḥ kaḥ k
Ashoka Brahmi 𑀅 𑀓 𑀆 𑀓𑀸         𑀇 𑀓𑀺 𑀈 𑀓𑀻 𑀉 𑀓𑀼 𑀊 𑀓𑀽     𑀏 𑀓𑁂 𑀐 𑀓𑁃     𑀑 𑀓𑁄 𑀒 𑀓𑁅     𑀋 𑀓𑀾 𑀌 𑀓𑀿 𑀍 𑀓𑁀 𑀎 𑀓𑁁 𑀅𑀁 𑀓𑀁 𑀅𑀂 𑀓𑀂 𑀓𑁆  
Devanagari का कॅ कॉ कि की कु कू कॆ के कै कॊ को कौ     कृ कॄ कॢ कॣ अं कं अः कः क्  
Bengali-
Assamese
কা অ্যা ক্যা     কি কী কু কূ     কে কৈ     কো কৌ     কৃ কৄ কৢ কৣ অং কং অঃ কঃ ক্  
Gujarati કા કૅ કૉ કિ કી કુ કૂ     કે કૈ     કો કૌ     કૃ કૄ કૢ કૣ અં કં અઃ કઃ ક્,ક્‍  
Odia କା         କି କୀ କୁ କୂ     କେ କୈ     କୋ କୌ     କୃ କୄ କୢ କୣ ଅଂ କଂ ଅଃ କଃ କ୍  
Gurmukhi ਕਾ         ਕਿ ਕੀ ਕੁ ਕੂ     ਕੇ ਕੈ     ਕੋ ਕੌ                     ਅਂ ਕਂ ਅਃ ਕਃ ਕ੍  
Meitei Mayek[b] ꯑꯥ ꯀꯥ         ꯀꯤ ꯑꫫ ꯀꫫ ꯀꯨ ꯑꫬ ꯀꫬ     ꯑꯦ ꯀꯦ ꯑꯩ ꯀꯩ     ꯑꯣ ꯀꯣ ꯑꯧ ꯀꯧ                     ꯑꯪ ꯀꯪ ꯑꫵ ꯀꫵ  
Tibetan[c] ཨཱ ཀཱ         ཨི ཀི ཨཱི ཀཱི ཨུ ཀུ ཨཱུ ཀཱུ     ཨེ ཀེ ཨཻ ཀཻ     ཨོ ཀོ ཨཽ ཀཽ     རྀ ཀྲྀ རཱྀ ཀཷ ལྀ ཀླྀ ལཱྀ ཀླཱྀ ཨཾ ཀཾ ཨཿ ཀཿ ཀ྄  
Lepcha[c] ᰣᰦ ᰀᰦ         ᰣᰧ ᰀᰧ ᰣᰧᰶ ᰀᰧᰶ ᰣᰪ ᰀᰪ ᰣᰫ ᰀᰫ     ᰣᰬ ᰀᰬ         ᰣᰨ ᰀᰨ ᰣᰩ ᰀᰩ                     ᰣᰴ ᰀᰴ        
Limbu[c] ᤀᤠ ᤁᤠ         ᤀᤡ ᤁᤡ ᤀᤡ᤺ ᤁᤡ᤺ ᤀᤢ ᤁᤢ ᤀᤢ᤺ ᤁᤢ᤺ ᤀᤧ ᤁᤧ ᤀᤣ ᤁᤣ ᤀᤤ ᤁᤤ ᤀᤨ ᤁᤨ ᤀᤥ ᤁᤥ ᤀᤦ ᤁᤦ                     ᤀᤲ ᤁᤲ     ᤁ᤻  
Tirhuta 𑒁 𑒏 𑒂 𑒏𑒰         𑒃 𑒏𑒱 𑒄 𑒏𑒲 𑒅 𑒏𑒳 𑒆 𑒏𑒴   𑒏𑒺 𑒋 𑒏𑒹 𑒌 𑒏𑒻   𑒏𑒽 𑒍 𑒏𑒼 𑒎 𑒏𑒾     𑒇 𑒏𑒵 𑒈 𑒏𑒶 𑒉 𑒏𑒷 𑒊 𑒏𑒸 𑒁𑓀 𑒏𑓀 𑒁𑓁 𑒏𑓁 𑒏𑓂  
Kaithi 𑂃 𑂍 𑂄 𑂍𑂰         𑂅 𑂍𑂱 𑂆 𑂍𑂲 𑂇 𑂍𑂳 𑂈 𑂍𑂴     𑂉 𑂍𑂵 𑂊 𑂍𑂶     𑂋 𑂍𑂷 𑂌 𑂍𑂸                     𑂃𑂁 𑂍𑂁 𑂃𑂂 𑂍𑂂 𑂍𑂹  
Newa Prachalit 𑐀 𑐎 𑐁 𑐎𑐵         𑐂 𑐎𑐶 𑐃 𑐎𑐷 𑐄 𑐎𑐸 𑐅 𑐎𑐹     𑐊 𑐎𑐾 𑐋 𑐎𑐿     𑐌 𑐎𑑀 𑐍 𑐎𑑁 𑐆 𑐎𑐺 𑐇 𑐎𑐻 𑐈 𑐎𑐼 𑐉 𑐎𑐽 𑐀𑑄 𑐎𑑄 𑐀𑑅 𑐎𑑅 𑐎𑑂  
Sylheti Nagari   ꠇꠣ         ꠇꠤ     ꠇꠥ         ꠇꠦ ꠅꠂ ꠇꠂ     ꠇꠧ                         ꠀꠋ ꠇꠋ     ꠇ꠆  
Tamil கா         கி கீ கு கூ கெ கே கை கொ கோ கௌ                     அஂ கஂ அஃ கஃ க்  
Kannada ಕಾ         ಕಿ ಕೀ ಕು ಕೂ ಕೆ ಕೇ ಕೈ ಕೊ ಕೋ ಕೌ     ಕೃ ಕೄ ಕೢ ಕೣ అం ಕಂ అః ಕಃ ಕ್  
Telugu కా         కి కీ కు కూ కె కే కై కొ కో కౌ     కృ కౄ కౢ కౣ అం కం అః కః క్  
Sinhala කා කැ කෑ කි කී කු කූ කෙ කේ කෛ කො කෝ කෞ     කෘ කෲ කෟ කෳ අං කං අඃ කඃ ක්  
Malayalam കാ         കി കീ കു കൂ കെ കേ കൈ കൊ കോ കൗ     കൃ കൄ കൢ കൣ അം കം അഃ കഃ ക് ക്
Chakma 𑄃𑄧 𑄇𑄧 𑄃 𑄇 𑄃𑄬𑄬 𑄇𑄬𑄬 𑄃𑅅 𑄇𑅅 𑄄, 𑄃𑄨 𑄇𑄨 𑄃𑄩 𑄇𑄩 𑄅, 𑄃𑄪 𑄇𑄪 𑄃𑄫 𑄇𑄫     𑄆, 𑄃𑄬 𑄇𑄬 𑄃𑄰 𑄇𑄰     𑄃𑄮 𑄇𑄮 𑄃𑄯 𑄇𑄯                     𑄃𑄧𑄁 𑄇𑄧𑄁 𑄃𑄧𑄂 𑄇𑄧𑄂 𑄇𑄴  
Burmese က အာ ကာ         ကိ ကီ ကု ကူ     ကေ အဲ ကဲ ကော     အောင် ကောင်     ကၖ ကၗ ကၘ ကၙ အံ ကံ အး ကး က်  
Tai Tham ᩋᩣ ᨠᩣ/ᨠᩤ ᩋᩯ ᨠᩯ (ᩋᩬᩴ,ᩋᩳ) (ᨠᩬᩴ,ᨠᩳ) ᨠᩥ ᨠᩦ ᨠᩩ ᨠᩪ (ᩋᩮᩡ) (ᨠᩮᩡ) ᨠᩮ ᩋᩱ ᨠᩱ (ᩋᩰᩡ) (ᨠᩰᩡ) ᩒ,ᩋᩰ ᨠᩰ,ᨠᩮᩣ ᩋᩮᩢᩣ,ᩋᩯᩣ,ᩐᩣ ᨠᩮᩢᩣ,ᨠᩮᩫᩣ,ᨠᩯᩣ     ᩁᩂ ᨠᩂ             ᩋᩴ ᨠᩴ ᩋᩡ ᨠᩡ ᨠ᩺,ᨠ᩼  
New Tai Lue[c] ᦀᦱ ᦂᦱ ᦶᦀ ᦶᦂ (ᦀᦸ) (ᦂᦸ) ᦀᦲᦰ ᦂᦲᦰ ᦀᦲ ᦂᦲ ᦀᦳ ᦂᦳ ᦀᦴ ᦂᦴ (ᦵᦀᦰ) (ᦵᦂᦰ) ᦵᦀ ᦵᦂ ᦺᦀ ᦺᦂ (ᦷᦀᦰ) (ᦷᦂᦰ) ᦷᦀ ᦷᦂ ᦀᧁ ᦂᧁ                         ᦀᦰ ᦂᦰ  
Khmer[d] អា កា         កិ កី កុ កូ     កេ កៃ     កោ កៅ     ក្ឫ ក្ឬ ក្ឭ ក្ឮ អំ កំ អះ កះ ក៑  
Thai[c] อ (อะ) ก (กะ) อา กา แอ แก (ออ) (กอ) อิ กิ อี กี อุ กุ อู กู (เอะ) (เกะ) เอ เก ไอ,ใอ ไก,ใก (โอะ) (โกะ) โอ โก เอา เกา     กฺฤ ฤๅ กฺฤๅ กฺฦ ฦๅ กฺฦๅ อํ กํ อะ (อะฮฺ) กะ (กะฮฺ) กฺ (ก/ก์)  
Lao[c] ອະ ກະ ອາ ກາ ແອ ແກ (ອອ) (ກອ) ອິ ກິ ອີ ກີ ອຸ ກຸ ອູ ກູ (ແອະ) (ແກະ) ເອ ເກ ໄອ,ໃອ ໄກ,ໃກ (ໂອະ) (ໂກະ) ໂອ ໂກ ເອົາ,ອາວ ເກົາ,ກາວ                     ອํ ກํ ອະ ກະ  
Cham ꨀꨩ ꨆꨩ         ꨆꨪ ꨁꨩ ꨆꨫ ꨆꨭ ꨂꨩ ꨆꨭꨩ     ꨆꨯꨮ ꨆꨰ     ꨆꨯ ꨀꨯꨱ ꨆꨯꨱ     ꨣꨮ ꨆꨴꨮ ꨣꨮꨩ ꨆꨴꨮꨩ ꨤꨮ ꨆꨵꨮ ꨤꨮꨩ ꨆꨵꨮꨩ ꨀꩌ ꨆꩌ ꨀꩍ ꨆꩍ  
Kawi 𑼄 𑼒 𑼅 𑼒𑼴 𑼆 𑼒𑼶 𑼇 𑼒𑼷 𑼈 𑼒𑼸 𑼉 𑼒𑼹 𑼎 𑼒𑼾 𑼏 𑼒𑼿 𑼐 𑼒𑼾𑼴 𑼐𑼴 𑼒𑼿𑼴 𑼄𑽀 𑼒𑽀 𑼊 𑼒𑼺 𑼊𑼴 𑼒𑼺𑼴 𑼌 𑼒𑽂𑼌 𑼍 𑼒𑽂𑼭𑽀𑼴 𑼄𑼁 𑼒𑼁 𑼄𑼃 𑼒𑼃 𑼒𑽁  
Balinese ᬓᬵ         ᬓᬶ ᬓᬷ ᬓᬸ ᬓᬹ ᬓᬾ     ᬓᬿ ᬓᭀ     ᬓᭁ ᬅᭂ ᬓᭂ ᬓᬺ ᬓᬻ ᬓᬼ ᬓᬽ ᬅᬂ ᬓᬂ ᬅᬄ ᬓᬄ ᬓ᭄  
Javanese ꦄꦴ ꦏꦴ         ꦏꦶ ꦏꦷ ꦏꦸ ꦈꦴ ꦏꦹ ꦏꦺ     ꦏꦻ ꦏꦺꦴ     ꦎꦴ ꦏꦻꦴ ꦄꦼ ꦏꦼ ꦏꦽ ꦉꦴ ꦏꦽꦴ ꦏ꧀ꦊ ꦏ꧀ꦋ ꦄꦁ ꦏꦁ ꦄꦃ ꦏꦃ ꦏ꧀  
Sundanese             ᮊᮤ     ᮊᮥ     ᮊᮦ         ᮊᮧ         ᮊᮨ [e] ᮊ᮪ᮻ[e]     [e] ᮊ᮪ᮼ[e]     ᮃᮀ ᮊᮀ ᮃᮂ ᮊᮂ ᮊ᮪  
Lontara             ᨕᨗ ᨀᨗ     ᨕᨘ ᨀᨘ     ᨕᨙ ᨀᨙ         ᨕᨚ ᨀᨚ         ᨕᨛ ᨀᨛ                            
Makasar 𑻱 𑻠 𑻱𑻳 𑻠𑻳 𑻱𑻴 𑻠𑻴 𑻱𑻵 𑻠𑻵 𑻱𑻶 𑻠𑻶      
Rejang     ꥆꥎ ꤰꥎ ꥆꥍ ꤰꥍ ꥆꥇ ꤰꥇ     ꥆꥈ ꤰꥈ     ꥆꥉ ꤰꥉ     ꥆꥊ ꤰꥊ ꥆꥋ ꤰꥋ     ꥆꥌ ꤰꥌ                     ꥆꥏ ꤰꥏ ꥆꥒ ꤰꥒ ꤰ꥓  
Batak (Toba)             ᯂᯪ     ᯂᯮ       ᯂᯩ           ᯂᯬ                             ᯀᯰ ᯂᯰ ᯀᯱ ᯂᯱ ᯂ᯲  
Baybayin             ᜃᜒ     ᜃᜓ     ᜃᜒ         ᜃᜓ                                     ᜃ᜔  
Buhid             ᝃᝒ     ᝃᝓ                                                            
Hanunuo             ᜣᜲ     ᜣᜳ                                                         ᜣ᜴  
Tagbanwa             ᝣᝲ     ᝣᝳ                                                            
ISO a ka ā ê ô i ki ī u ku ū e ke ē ai kai o ko ō au kau ə kr̥ r̥̄ kr̥̄ kl̥ l̥̄ kl̥̄ aṁ kaṁ aḥ kaḥ k
a ā ê ô i ī u ū e ē ai o ō au ə r̥̄ l̥̄    

Notes

  1. ^ a b c Letters for r̥̄, , l̥̄ and a few others are obsolete or very rarely used.
  2. ^ includes supplementary vowels not in contemporary use
  3. ^ a b c d e f Tibetan, Lepcha, Limbu, New Tai Lue, Thai and Lao scripts do not have independent vowel forms. For syllables starting with a vowel sound, a "zero" consonant (ཨ, อ or ອ respectively) is used to represent the glottal stop /ʔ/.
  4. ^ When used to write their own languages, Khmer can have either an a or an o as the inherent vowel, following the rules of its orthography.
  5. ^ a b c d Letters used in Old Sundanese. They are now obsolete.

Numerals

[edit]
Hindu-Arabic 0 1 2 3 4 5 6 7 8 9
Brahmi numbers 𑁒 𑁓 𑁔 𑁕 𑁖 𑁗 𑁘 𑁙 𑁚
Brahmi digits 𑁦 𑁧 𑁨 𑁩 𑁪 𑁫 𑁬 𑁭 𑁮 𑁯
Bengali- Assamese
Tirhuta 𑓐 𑓑 𑓒 𑓓 𑓔 𑓕 𑓖 𑓗 𑓘 𑓙
Odia
Devanagari
Gujarati
Modi 𑙐‎ 𑙑‎ 𑙒 𑙓‎ 𑙔‎ 𑙕 𑙖‎ 𑙗 𑙘‎ 𑙙
Sharada 𑇐 𑇑 𑇒 𑇓 𑇔 𑇕 𑇖 𑇗 𑇘 𑇙
Takri 𑛀 𑛁 𑛂 𑛃 𑛄 𑛅 𑛆 𑛇 𑛈 𑛉
Gurmukhi
Khudabadi 𑋰 𑋱 𑋲 𑋳 𑋴 𑋵 𑋶 𑋷 𑋸 𑋹
Meitei (Manipuri)
Pracalit 𑑐‎ 𑑑‎ 𑑒‎ 𑑓‎ 𑑔‎ 𑑕‎ 𑑖‎ 𑑗‎ 𑑘‎ 𑑙
Tibetan
Mongolian[a]
Lepcha
Limbu
Sinhala astrological numbers
Sinhala archaic numbers 𑇡 𑇢 𑇣 𑇤 𑇥 𑇦 𑇧 𑇨 𑇩
Tamil
Telugu
Kannada
Malayalam
Saurashtra
Ahom 𑜰 𑜱 𑜲 𑜳 𑜴 𑜵 𑜶 𑜷 𑜸 𑜹
Chakma 𑄶 𑄷 𑄸 𑄹 𑄺 𑄻 𑄼 𑄽 𑄾 𑄿
Burmese
Tai Tham Astrological Numbers[b]
New Tai Lue
Shan
Khmer
Thai
Lao
Tai Tham[c]
Cham
Balinese
Javanese
Sundanese
Hindu-Arabic 0 1 2 3 4 5 6 7 8 9

Notes

  1. ^ Mongolian numerals are derived from Tibetan numerals and used in conjunction with the Mongolian and Clear script
  2. ^ for everyday use
  3. ^ for liturgical use

List of Brahmic scripts

[edit]

Historical

[edit]

The Brahmi script was already divided into regional variants at the time of the earliest surviving epigraphy around the 3rd century BC. Cursives of the Brahmi script began to diversify further from around the 5th century AD and continued to give rise to new scripts throughout the Middle Ages. The main division in antiquity was between northern and southern Brahmi. In the northern group, the Gupta script was very influential, and in the southern group the Vatteluttu and Kadamba/Pallava scripts with the spread of Buddhism sent Brahmic scripts throughout Southeast Asia.[citation needed]

Early Brahmic scripts
IAST Ashoka Girnar Chandra­gupta Gujarat Prayagraj Narbada Kistna
a
ā
i
ī
u
ū
e
ai
o
au
k
kh
g
gh
c
ch
j
jh
ñ
ṭh
ḍh
t
th
d
dh
n
p
ph
b
bh
m
y
r
l
v
ś
s
h

Northern Brahmic

[edit]

Southern Brahmic

[edit]

Unicode of Brahmic scripts

[edit]

As of Unicode version 17.0, the following Brahmic scripts have been encoded:

script derivation Period of derivation usage notes ISO 15924 Unicode range(s) sample
Ahom Burmese[5] 13th century Extinct Ahom language Ahom U+11700–U+1174F 𑜒𑜠𑜑𑜨𑜉
Balinese Kawi 11th century Balinese language Bali U+1B00–U+1B7F ᬅᬓ᭄ᬲᬭᬩᬮᬶ
Batak Pallava 14th century Batak languages Batk U+1BC0–U+1BFF ᯘᯮᯒᯖ᯲ ᯅᯖᯂ᯲
Baybayin Kawi 14th century Tagalog, other Philippine languages Tglg U+1700–U+171F ᜊᜌ᜔ᜊᜌᜒᜈ᜔
Bengali-Assamese (Eastern Nagari) Siddhaṃ 11th century Assamese language (Assamese script variant), Bengali language (Bengali script variant), Bishnupriya, Maithili, Meitei language (constitutionally termed as "Manipuri")[6] Beng U+0980–U+09FF
  • অসমীয়া লিপি
  • বাংলা লিপি
Bhaiksuki Gupta 11th century Was used around the turn of the first millennium for writing Sanskrit Bhks U+11C00–U+11C6F 𑰥𑰹𑰎𑰿𑰬𑰲𑰎𑰱
Buhid Kawi 14th century Buhid language Buhd U+1740–U+175F ᝊᝓᝑᝒᝇ
Mon-Burmese Pallava 11th century Burmese language, Mon language, numerous modifications for other languages including Chakma, Eastern and Western Pwo Karen, Geba Karen, Kayah, Rumai Palaung, S'gaw Karen, Shan Mymr U+1000–U+109F, U+A9E0–U+A9FF, U+AA60–U+AA7F, U+116D0–U116FF မြန်မာအက္ခရာ
Chakma Burmese 8th century Chakma language Cakm U+11100–U+1114F 𑄌𑄋𑄴𑄟𑄳𑄦
Cham Pallava 8th century Cham language Cham U+AA00–U+AA5F ꨌꩌ
Devanagari Nagari 13th century Several Indo-Aryan languages (Konkani, Marathi, Hindi, Sanskrit, Nepali, Bhili, Sindhi, Gujarati etc), Sino-Tibetan languages (Bodo, Nepal Bhasa, Sherpa etc.), Mundari (Austroasiatic language) and others. Deva U+0900–U+097F, U+A8E0–U+A8FF, U+11B00–U+11B5F देवनागरी
Dhives Akuru Gupta Before 6th-8th century Was used to write the Maldivian language up until the 20th century.[7] Diak U+11900–U+1195F 𑤞𑥂𑤧𑤭𑥂
Dogra Takri Was used to write Dogri. Dogra script is closely related to Takri.[8] Dogr U+11800–U+1184F 𑠖𑠵𑠌𑠤𑠬
Grantha Pallava 6th century Restricted use in traditional Vedic schools to write Sanskrit. Was widely used by Tamil speakers for Sanskrit and the classical language Manipravalam. Gran U+11300–U+1137F 𑌗𑍍𑌰𑌨𑍍𑌥
Gujarati Nagari 17th century Gujarati language, Kutchi language Gujr U+0A80–U+0AFF ગુજરાતી લિપિ
Gunjala Gondi uncertain 16th century Used for writing the Adilabad dialect of the Gondi language.[9] Gong U+11D60–U+11DAF 𑵶𑶍𑶕𑶀𑵵𑶊 𑵶𑶓𑶕𑶂𑶋
Gurmukhi Sharada 16th century Punjabi language Guru U+0A00–U+0A7F ਗੁਰਮੁਖੀ
Hanunó'o Kawi 14th century Hanuno'o language Hano U+1720–U+173F ᜱᜨᜳᜨᜳᜢ
Javanese Kawi 16th century Javanese language, Sundanese language, Madurese language Java U+A980–U+A9DF ꦄꦏ꧀ꦱꦫꦗꦮ
Kaithi Nagari 16th century Historically used for writing legal, administrative, and private records. Kthi U+11080–U+110CF 𑂍𑂶𑂟𑂲
Kannada Telugu-Kannada Around 4th-6th century Sanskrit, Kannada, Konkani, Tulu, Badaga, Kodava, Beary, others Knda U+0C80–U+0CFF ಕನ್ನಡ ಅಕ್ಷರಮಾಲೆ
Kawi Pallava 8th century Kawi was found primarily in Java and used across much of Maritime Southeast Asia between the 8th century and the 16th century.[10] Kawi U+11F00–U+11F5F 𑼒𑼮𑼶
Khmer Pallava 11th century Khmer language Khmr U+1780–U+17FF, U+19E0–U+19FF អក្សរខ្មែរ
Khojki Landa 16th century Some use by Ismaili communities. Was used by the Khoja community for Muslim religious literature. Khoj U+11200–U+1124F 𑈉𑈲𑈐𑈈𑈮
Khudawadi Landa 16th century Was used by Sindhi communities for correspondence and business records. Sind U+112B0–U+112FF 𑊻𑋩𑋣𑋏𑋠𑋔𑋠𑋏𑋢
Lao Khmer 14th century Lao language, others Laoo U+0E80–U+0EFF ອັກສອນລາວ
Lepcha Tibetan 8th century Lepcha language Lepc U+1C00–U+1C4F ᰛᰩᰴ
Limbu Lepcha 9th century Limbu language Limb U+1900–U+194F ᤛᤡᤖᤡᤈᤨᤅ
Lontara Kawi 17th century Buginese language, others Bugi U+1A00–U+1A1F ᨒᨚᨈᨑ
Mahajani Landa 16th century Historically used in northern India for writing accounts and financial records. Mahj U+11150–U+1117F 𑅬𑅱𑅛𑅧𑅑
Makasar Kawi 17th century Was used in South Sulawesi, Indonesia for writing the Makassarese language.[11] Makasar script is also known as "Old Makassarese" or "Makassarese bird script" in English-language scholarly works.[12] Maka U+11EE0–U+11EFF 𑻪𑻢𑻪𑻢
Malayalam Grantha 12th century Malayalam Mlym U+0D00–U+0D7F മലയാളലിപി
Marchen Tibetan 7th century Was used in the Tibetan Bön tradition to write the extinct Zhang-Zhung language Marc U+11C70–U+11CBF 𑱳𑲁𑱽𑱾𑲌𑱵𑲋𑲱𑱴𑱶𑲱𑲅𑲊𑱱
Meetei Mayek Tibetan[13] 6th century[14] officially used for Meitei language (constitutionally termed as "Manipuri") in accordance to "The Manipur Official Language (Amendment) Act, 2021"[6] Mtei U+AAE0–U+AAFF, U+ABC0–U+ABFF ꯃꯤꯇꯩ ꯃꯌꯦꯛ
Modi Nāgarī 17th century Was used to write the Marathi language Modi U+11600–U+1165F 𑘦𑘻𑘚𑘲
Multani Landa Was used to write the Multani language Mult U+11280–U+112AF 𑊠𑊣𑊖𑊚
Nandinagari Nāgarī 7th century Historically used to write Sanskrit in southern India Nand U+119A0–U+119FF 𑧁𑧞𑦿𑧒𑧁𑧑𑦰𑧈𑧓
New Tai Lue Tai Tham 1950s Tai Lü language Talu U+1980–U+19DF ᦟᦲᧅᦷᦎᦺᦑ
Odia Siddhaṃ 13th century Odia language Orya U+0B00–U+0B7F ଓଡ଼ିଆ ଅକ୍ଷର
ʼPhags-pa Tibetan 13th century Historically used during the Mongol Yuan dynasty. Phag U+A840–U+A87F ꡖꡍꡂꡛ ꡌ
Prachalit (Newa) Nepal Has been used for writing the Sanskrit, Nepali, Hindi, Bengali, and Maithili languages Newa U+11400–U+1147F 𑐥𑑂𑐬𑐔𑐮𑐶𑐟
Rejang Kawi 18th century Rejang language, mostly obsolete Rjng U+A930–U+A95F ꥆꤰ꥓ꤼꤽ ꤽꥍꤺꥏ
Saurashtra Grantha 20th century Saurashtra language, mostly obsolete Saur U+A880–U+A8DF ꢱꣃꢬꢵꢰ꣄ꢜ꣄ꢬꢵ
Sharada Gupta 8th century Was used for writing Sanskrit and Kashmiri Shrd U+11180–U111DF, U+11B60–U11B7F 𑆯𑆳𑆫𑆢𑆳
Siddham Gupta 7th century Was used for writing Sanskrit Sidd U+11580–U+115FF 𑖭𑖰𑖟𑖿𑖠𑖽
Sinhala Brahmi[15] 4th century[16] Sinhala language Sinh U+0D80–U+0DFF, U+111E0–U+111FF ශුද්ධ සිංහල
Sundanese Kawi 14th century Sundanese language Sund U+1B80–U+1BBF, U+1CC0–U+1CCF ᮃᮊ᮪ᮞᮛ ᮞᮥᮔ᮪ᮓ
Sylheti Nagari Nagari 16th century Historically used for writing the Sylheti language Sylo U+A800–U+A82F ꠍꠤꠟꠐꠤ ꠘꠣꠉꠞꠤ
Tagbanwa Kawi 14th century Various languages of Palawan, nearly extinct Tagb U+1760–U+177F ᝦᝪᝨᝯ
Tai Le Mon 13th century Tai Nüa language Tale U+1950–U+197F ᥖᥭᥰᥖᥬᥳᥑᥨᥒᥰ
Tai Tham Mon 13th century Northern Thai language, Tai Lü language, Khün language Lana U+1A20–U+1AAF ᨲᩫ᩠ᩅᨾᩮᩬᩥᨦ
Tai Viet Thai 16th century Tai Dam language Tavt U+AA80–U+AADF ꪼꪕꪒꪾ
Tai Yo/Lai Tay Khmer 16th century Tai Yo language[17] Tayo U+1E6C0–U1E6FF
Takri Sharada 16th century Was used for writing Chambeali, and other languages Takr U+11680–U+116CF 𑚔𑚭𑚊𑚤𑚯
Tamil Pallava 2nd century Tamil language Taml U+0B80–U+0BFF, U+11FC0–U+11FFF தமிழ் அரிச்சுவடி
Telugu Telugu-Kannada 5th century Telugu language Telu U+0C00–U+0C7F తెలుగు లిపి
Thai Old Khmer 13th century Thai language Thai U+0E00–U+0E7F อักษรไทย
Tibetan Gupta 8th century Classical Tibetan, Dzongkha, Ladakhi language Tibt U+0F00–U+0FFF བོད་ཡིག་
Tigalari/Tulu Grantha 9th century Tulu, Kannada, and Sanskrit Tutg U+11380–U113FF 𑎡𑎻𑎳𑎻𑎭𑎹𑎦𑎹
Tirhuta Siddham 13th century Historically used for the Maithili language Tirh U+11480–U+114DF 𑒞𑒱𑒩𑒯𑒳𑒞𑒰

See also

[edit]

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
The Brahmic scripts, also known as Indic scripts, constitute a large family of abugida writing systems descended from the ancient Brahmi script of the Indian subcontinent, employed today to write numerous languages across South Asia, Southeast Asia, and parts of Central and East Asia.[1][2] Originating around the 3rd century BCE during the reign of Mauryan Emperor Ashoka, whose rock edicts provide the earliest substantial attestations, the Brahmi script may trace its roots to even earlier forms, such as Tamil-Brahmi inscriptions possibly dating to the 7th century BCE based on 2024 radiocarbon analysis.[1][3][4] These scripts evolved through regional variations, notably during the Gupta Empire in the 5th century CE, when Brahmi diversified into northern and southern branches, leading to medieval forms like Siddham, Sharada, and Nagari; this process continued into the modern era, adapting to phonetic needs of Indo-Aryan, Dravidian, Austroasiatic, and other language families.[1][5] Key characteristics include an inherent short vowel /a/ (or /o/ in some like Bengali and Odia) attached to each consonant, with dependent diacritics for other vowels, independent vowel signs, consonant ligatures (conjuncts) for clusters, and matras for modifications like nasalization or aspiration, all arranged horizontally from left to right without spaces between words in classical usage.[1][2] Prominent contemporary examples encompass Devanagari (used for Hindi, Marathi, Nepali, and Sanskrit), Bengali-Assamese, Gurmukhi (Punjabi), southern Dravidian scripts like Tamil, Telugu, Kannada, and Malayalam, as well as Southeast Asian derivatives such as Thai, Khmer, Javanese, and East Asian ones like Tibetan.[1][6] This script family has profoundly shaped the documentation of religious texts, literature, and administration in Buddhist, Hindu, and Jain traditions, facilitating the spread of Indian cultural influence across Asia.[1][7]

Historical Development

Origins in the Brahmi Script

The Brahmi script emerged as the foundational writing system for ancient India, with its earliest attested uses dating to the 3rd century BCE during the Mauryan Empire under Emperor Ashoka. The script is most famously documented in Ashoka's edicts, a series of inscriptions carved on rocks, pillars, and cave walls across the subcontinent between approximately 268 and 232 BCE, primarily in Prakrit to propagate Buddhist principles and imperial policies. These edicts represent the first indisputably dated and widespread application of Brahmi, demonstrating its role in official and religious communication.[8] Scholars debate the origins of Brahmi, with two primary theories dominating the discourse: indigenous development and external Semitic influence. The indigenous hypothesis suggests Brahmi evolved from earlier local traditions, potentially drawing from the undeciphered symbols of the Indus Valley Civilization (circa 2600–1900 BCE), positing a continuity in South Asian writing practices despite the gap in evidence. In contrast, the Aramaic hypothesis, first systematically proposed by Georg Bühler in the late 19th century, argues for derivation from the Aramaic script used in the Achaemenid Empire, citing graphic similarities in several letter forms and the coexistence of Aramaic and proto-Brahmi on punch-marked silver coins from the 4th century BCE, which circulated in the northwest under Mauryan control. Evidence from these coins, including bilingual legends, supports the idea of adaptation during cultural exchanges along trade routes, though the exact mechanism of transmission remains unresolved.[9][10][11] Brahmi's initial phonetic inventory comprised 33 consonants, arranged in groups based on articulation points (gutturals, palatals, cerebrals, dentals, and labials), and 10 vowels (short and long forms of a, i, u, plus e, ai, o, and au), with independent symbols for vowels and diacritics modifying consonants to form syllables. This abugida structure inherently vocalizes consonants with an a sound unless modified. Early inscriptions, such as the Tamil-Brahmi variants from South Indian sites like the Mangulam caves (dated to the late 3rd century BCE), illustrate adaptations for Dravidian languages by simplifying the consonant set—omitting voiced aspirates and other Indo-Aryan phonemes unnecessary for Tamil, such as certain fricatives—and using simplified diacritics, as seen in names like cēra (Chera) inscribed on rock surfaces.[12][13] Paleographically, Brahmi exhibits a consistent left-to-right writing direction in horizontal lines, a defining feature that influenced all descendant scripts. Letters rest on or align with a horizontal baseline, often formed by a crossbar or the lower stroke of characters, promoting uniformity in inscription layout. Vowel indications rely on early diacritical marks—subscript or superscript strokes attached to the consonant's vertical stem—such as a leftward curve for i or a downward hook for u, marking an innovative departure from purely consonantal Semitic systems toward syllabic representation. These features, evident in Ashokan pillars' polished surfaces, underscore Brahmi's adaptability for monumental epigraphy.

Divergence and Regional Evolution

Following the Mauryan Empire's decline in the 2nd century BCE, the Brahmi script underwent significant post-Mauryan developments, with the Gupta script emerging as a key transitional form between the 4th and 6th centuries CE. This script, often termed Gupta Brahmi, represented a refined evolution from earlier Brahmi variants, incorporating more fluid and curved letter forms that introduced early cursive elements, facilitating smoother writing on materials like palm leaves and birch bark.[14] These changes reflected adaptations to administrative and literary needs in the Gupta Empire, marking a shift toward greater angularity reduction and ligature usage while retaining the core abugida structure inherited from Brahmi.[15] Major divergence points in Brahmic script evolution occurred around the 7th century CE, with the Siddham script branching off as a pivotal form for northern lineages, influencing scripts like Devanagari and Tibetan. Siddham, descending directly from Gupta Brahmi, standardized rounded and ornate forms suited for Buddhist manuscripts and esoteric texts, spreading through monastic networks across northern India and Central Asia.[16] Concurrently, in the south, the Pallava script developed from the 4th to 9th centuries CE, serving as the ancestor for southern Indian and Southeast Asian branches, including Grantha and early Dravidian adaptations. This script's angular, monumental style, evident in temple inscriptions of the Pallava dynasty, facilitated its transmission to regions like Tamil Nadu and beyond.[17] The proliferation of Brahmic scripts across Asia was driven by historical events, particularly Buddhist and Hindu missionary activities from the 1st to 10th centuries CE, which carried scripts via trade routes and religious expansion into Southeast Asia and Tibet. These missions, often supported by royal patronage, embedded Brahmic forms in Sanskrit and Pali texts, leading to localized adaptations in kingdoms like Funan and Srivijaya.[18] Later, Islamic invasions from the 8th century onward disrupted script survival in northern and western India, promoting Perso-Arabic alternatives and marginalizing Brahmic usage in administrative contexts, though southern variants persisted in temple traditions.[19] Colonial influences under British rule in the 19th century further shaped standardization, as printing presses and romanization efforts imposed typographic uniformity on surviving Brahmic scripts like Devanagari, blending traditional forms with mechanical constraints.[20] Key evolutions in the timeline include the 8th to 12th centuries CE, when precursors to the Nagari script emerged in northern India through inscriptional variants that bridged Siddham and proto-Devanagari, featuring increased horizontality and vowel mark refinements for vernacular use. From the 7th century CE, the Khmer script in Cambodia derived from Pallava influences, evolving into a distinct rounded form for Old Khmer inscriptions, reflecting sustained Southeast Asian adaptation amid Angkorian cultural flourishing.[21][22]

Core Characteristics

Abugida Structure and Syllabary

Brahmic scripts are classified as abugidas, a type of writing system where consonants serve as the primary graphemes, each inherently carrying a default vowel sound, typically /a/ though /o/ or /ɔ/ in scripts such as Bengali and Odia, and other vowels are indicated through diacritical marks known as matras attached to the consonant base.[23][1] This structure organizes writing around syllabic units, with each basic syllable formed by a consonant followed by its inherent vowel unless modified.[24] For instance, a standalone consonant symbol like k represents the syllable /ka/, reflecting the implicit /a/ vowel.[25] To represent vowels other than the inherent /a/, matras are employed as dependent signs positioned above, below, to the left, or to the right of the consonant, depending on the script's conventions, thus altering the syllable's pronunciation without altering the core consonant shape.[24] When no vowel is needed after a consonant, a virama (also called halant) suppresses the inherent /a/, rendering the consonant as a pure consonantal element that can combine with following sounds.[25] A simple example is the formation of /ki/: the consonant k pairs with the matra for /i/, creating ki, while /k/ alone uses k plus virama to eliminate the /a/.[24] Consonant clusters, common in many languages using these scripts, are formed through shared structural rules involving stacking or ligation. The virama links consonants, often resulting in half-forms or subjoined shapes where the second consonant is reduced and attached below or to the side of the first, forming a single glyph for the cluster.[24] For the consonant /r/, a special form called repha—typically a stroked or curved mark—appears above the preceding consonant or the cluster to indicate its initial position.[26] An example syllable like /kta/ might combine k with virama, followed by t and inherent /a/, yielding a stacked conjunct glyph.[27] Additional elements include the anusvara, a dot-like mark denoting nasalization after a vowel or consonant, and the visarga, two small dots representing a voiceless breath (/h/) at syllable ends.[24] These enhance syllabic expression; for nasalized /kã/, the anusvara follows ka, while visarga adds /kaḥ/ for aspiration. Such conventions ensure efficient syllabary representation across Brahmic scripts, prioritizing compact, phonetic encoding.[23]

Phonetic and Visual Features

Brahmic scripts are primarily designed to represent the phonological systems of Indo-Aryan and Dravidian languages, which share a core set of phonemes while incorporating unique features such as retroflex consonants produced with the tongue curled back against the hard palate.[28] These retroflex sounds, including stops like /ʈ/, /ʈʰ/, /ɖ/, and /ɖʱ/, are characteristic of South Asian languages and are distinctly encoded in the scripts' consonant inventories, distinguishing them from many other writing systems worldwide.[29] This phonetic accommodation allows for precise articulation of sounds central to the prosody and morphology of these language families, supporting the scripts' role in preserving linguistic diversity across the Indian subcontinent.[28] Visually, Brahmic scripts display distinctive hallmarks that vary by regional lineage, with Northern forms often characterized by angular strokes and a prominent horizontal headline—a continuous top line that links characters for structural cohesion—while Southern variants favor more rounded, curving strokes that evoke a fluid, palm-leaf inscription aesthetic.[30][31] This baseline alignment, where the lower portions of characters rest on an imaginary horizontal line, ensures uniform text flow and readability in horizontal writing direction, a trait inherited from ancient Brahmi prototypes. The interplay of these strokes contributes to the scripts' compact density, enabling efficient representation of syllabic units without excessive horizontal spread. Diacritics in Brahmic scripts, used to modify inherent vowel sounds or indicate additional phonetic nuances, are positioned above, below, to the left, or even encircling the base consonants, allowing flexibility in vowel notation.[32] However, in dense textual arrangements like consonant clusters or conjunct forms, these diacritics frequently overlap with adjacent elements, creating visual complexity that can challenge legibility and automated recognition systems.[33] Such placement strategies reflect adaptations to the phonetic needs of the languages, where vowel modifications must integrate seamlessly with the core abugida structure. In Southeast Asian derivatives of Brahmic scripts, linguistic influences from tonal languages have prompted further innovations, notably the addition of diacritics for tone indication, as seen in Thai where superscript or subscript marks alter pitch on syllables.[34] These tone diacritics, typically positioned above or below the base consonant, extend the scripts' phonetic expressiveness to accommodate suprasegmental features absent in original Indo-Aryan and Dravidian contexts, enhancing their utility for languages like Thai and Khmer.[34]

Comparative Analysis

Consonant Inventories

The Brahmi script, the progenitor of the Brahmic family, features a standard inventory of 33 consonants organized into five groups of stops (vargas) based on place of articulation—gutturals (ka to ṅa), palatals (ca to ña), retroflexes (ṭa to ṇa), dentals (ta to na), and labials (pa to ma)—followed by four semivowels (ya, ra, la, va), three sibilants (śa, ṣa, sa), and the aspirate ha.[35] These consonants represent phonemes including voiceless/voiced stops (unaspirated and aspirated), nasals, approximants, fricatives, and the glottal fricative, reflecting the phonetic needs of early Indo-Aryan languages like Prakrit and Sanskrit.[35] Northern Brahmic scripts, such as Devanagari and Bengali-Assamese, largely retain this full 33-consonant inventory, often with minor additions like the palatal lateral ḷa or nukta-modified forms for Perso-Arabic sounds (e.g., ẓa, qa), resulting in 33–39 letters depending on the language. In contrast, Southern Brahmic scripts exhibit reductions to adapt to Dravidian phonologies, which lack aspiration and certain fricatives; for instance, Tamil employs only 18 consonants, omitting aspirates (e.g., no kha, gha) and merging sibilants into a single s sound while introducing unique retroflex approximants like ḻa.[36] Telugu, another Southern script, maintains closer to 36 consonants, preserving most stops but simplifying sibilants.[37] Southeast Asian derivatives like Thai expand to 44 consonants, incorporating tonal distinctions and additional letters for Mon-Khmer influences while retaining the core Brahmi order. Conjunct formations in Brahmic scripts indicate consonant clusters by suppressing the inherent vowel of the first consonant via a virama (halant) diacritic, leading to ligatures or stacked forms based on script-specific rules. In Northern scripts like Devanagari, ligation often involves halving the first consonant's glyph and fusing it to the second, as in क् + त = क्त (kta, where the vertical stroke of ka is removed and ta attaches below). Bengali follows similar rules but favors more explicit stacking for clarity, yielding ক্ + ত = ক্ত (kta).[38] Southern scripts like Tamil limit conjuncts due to simpler phonotactics, using stacked forms sparingly (e.g., க் + த = க்த for kta in loanwords) or Grantha supplements for Sanskrit terms, while Telugu employs curved ligatures like క్ + త = క్త. Thai avoids complex ligation, writing clusters sequentially with subjoined forms or diacritics, such as ก + ต = กต (kta, with no fusion).[39] These variations ensure efficient representation of syllable-onset clusters without vowels. Vowel modifications to these base consonants, such as matras, are addressed elsewhere.
Phoneme GroupDevanagari (Northern)Bengali (Northern)Tamil (Southern)Telugu (Southern)Thai (Southeast Asian)
Gutturals (ka-varga)क ख ग घ ङ (ka kha ga gha ṅa)ক খ গ ঘ ঙ (ka kha ga gha ṅa)க ங (ka ṅa; no aspirates)క ఖ గ ఘ ఙ (ka kha ga gha ṅa)ก ข ฃ ค ฅ ฆ (ko kho ngo etc.; tonal classes)
Palatals (ca-varga)च छ ज झ ञ (ca cha ja jha ña)চ ছ জ ঝ ঞ (ca cha ja jha ña)ச ஞ (ca ña; no aspirates)చ ఛ జ ఝ ఞ (ca cha ja jha ña)ฉ ช ซ ฌ (cho etc.; expanded for tones)
Example Conjunct (kta)क्तক্তக்த (rare/stacked)క్తกต (sequential)

Vowel Diacritics and Matras

In Brahmic scripts, vowels are primarily represented through diacritics known as matras, which attach to a consonant base to modify its inherent vowel sound, typically the short a (/ə/ or /a/). The core vowel inventory derives from the ancient Brahmi script, encompassing 10 to 14 phonemes organized as short/long pairs: short a (inherent), i, u, , ; long ā, ī, ū, e, ai, o, au, with occasional diphthongs or additional forms like ē in later evolutions. These matras evolved from Brahmi's simple strokes, such as horizontal bars for ā or hooks for i, and are essential for phonetic accuracy in syllabic writing. Placement of matras varies significantly across Brahmic scripts, reflecting regional adaptations for visual clarity and aesthetic integration. Pre-base matras, such as those for e and ai, attach to the left of the consonant (e.g., in Devanagari's ◌े for e), while post-base forms appear to the right (e.g., Devanagari's ◌ो for o). Supra-base diacritics are placed above the consonant, common for long vowels like ī in scripts such as Bengali (ঁ) or Thai (ิ for i), and sub-base matras hang below, as seen in Telugu's ూ for ū. These positions can stack or combine in complex conjuncts, with Southern Brahmic scripts like Tamil favoring more linear, non-curved forms to suit palm-leaf engraving, whereas Northern scripts like Gujarati employ curlier attachments. Independent vowel letters, used for words beginning with vowels or standalone vowels, mirror the matra shapes but stand alone, often elongated or stylized for prominence. In Northern Brahmic scripts like Devanagari, independent a is अ (a triangle-like form), evolving into more circular shapes in Southeast Asian derivatives like Thai's อ for a, which prioritizes rounded aesthetics influenced by Khmer. Southern scripts, such as Kannada's ಅ for a, retain linear, angular profiles from early Brahmi, facilitating compact inscription. These independent forms ensure vowels function as full akṣaras without a consonant carrier. The following table compares matra forms for key vowels in representative Brahmic scripts, highlighting positional variations and notable omissions (e.g., Southern scripts like Tamil lack distinct or matras, relying on approximations or loan adaptations).
VowelDevanagari (Northern)Bengali (Eastern)Kannada (Southern)Thai (Southeast Asian)Notes
ā (long a)◌ा (post-base vertical)◌া (post-base curve)◌ಾ (post-base stroke)◌า (post-base line)Universal post-base; inherent a omitted in Thai.
i (short i)◌ि (sub-base hook)◌ি (sub-base vertical)◌ಿ (sub-base vertical stroke)ิ (supra-base curve)Sub- to supra- shift in Thai for clarity.
ī (long i)◌ी (post-base vertical + sub-hook)◌ী (post-base + sub-vertical)◌ೀ (post-base vertical + sub-base vertical stroke)ี (supra-base double)Stacked in Northern; simplified in Southern. No ī distinction in some Dravidian contexts.
u (short u)◌ु (sub-base curve)◌ু (sub-base hook)◌ು (sub-base curve)ุ (sub-base curve)Consistent sub-base; omitted in Tamil (uses u approximation).
ū (long u)◌ू (sub-base vertical + curve)◌ূ (sub-base + hook)◌ೂ (sub-base vertical)ู (sub-base double)Vertical extension common; no ū in early Southern scripts.
e (short e)◌े (pre-base curve)◌ে (pre-base horizontal)◌ೆ (pre-base hook)เ (pre-base line)Pre-base dominant; Thai linear for print efficiency.
ai (diphthong)◌ै (pre + post curves)◌ৈ (pre + post hooks)◌ೈ (pre + post strokes)ไ (pre + supra)Combined positions; simplified in Southeast Asian forms. No native ai in some Southern inventories.
This comparative framework underscores the adaptability of matras, where Southern Brahmic scripts often omit retroflex vowels like (represented as ri in Tamil via consonant-vowel combos), prioritizing phonemic mergers for local languages.

Numeral Systems

The Brahmi numeral system emerged around the 3rd century BCE in the Indian subcontinent as part of the broader Brahmi script, utilizing an additive and multiplicative structure without inherent place-value notation. Symbols for numbers 1 through 3 were simple horizontal strokes, representing one, two, or three lines respectively, while 4 took a cross-like form reminiscent of directional lines, 5 a curved U-shape, 6 an angular chevron, 7 a hooked stroke, 8 a circle with an internal line, and 9 a full circle. This system required separate glyphs for tens (10–90), hundreds (100–900), and higher powers, combined additively to form larger numbers, reflecting an early decimal base but limited by the absence of zero as a placeholder.[40] During the Gupta period (c. 320–550 CE), Brahmi numerals underwent significant evolution, transitioning toward a true place-value system that positioned digits to denote powers of ten, with the introduction of a dot (bindu) as the precursor to the modern zero symbol around the 5th century CE. This innovation, evident in inscriptions and mathematical texts, facilitated more efficient arithmetic and laid the foundation for the global Hindu-Arabic system, as Gupta-era forms spread across the expanding empire.[40] As Brahmic scripts diverged regionally, so did their numeral glyphs, adapting to local aesthetic and calligraphic traditions while preserving the decimal structure. Northern Brahmic scripts, such as Devanagari and its relatives, developed more angular, rod-like forms with straight lines and sharp angles, emphasizing verticality and linearity. In contrast, Southern Brahmic scripts like Tamil evolved rounded, curvaceous shapes influenced by palm-leaf writing, which favored fluid strokes to prevent tearing the medium. Southeast Asian adaptations, particularly in Thai derived from Khmer (itself a Brahmic descendant via Pallava influences), produced numerals with hybrid features—often arabic-like in overall silhouette but retaining curved loops and hooks traceable to Indian prototypes.[40] The widespread adoption of the Indo-Arabic numeral system from the medieval period onward, facilitated by Islamic scholarship and European trade, largely supplanted traditional Brahmic forms in commercial and scientific contexts across South and Southeast Asia. However, Brahmic numerals endure in cultural and ritualistic roles, such as denoting dates in Hindu calendars (e.g., Vikram Samvat), religious manuscripts, and decorative motifs in architecture and typography for aesthetic continuity.[41] To illustrate regional variations, the following table compares glyphs for 0–9 in selected Brahmic scripts, highlighting traits like the dot-derived zero (often a small circle or point) and divergent curvatures. Northern forms (Devanagari, Bengali) tend toward geometric precision, Southern (Tamil) emphasize softness, and Southeast Asian (Thai) blend familiarity with ornate flourishes.
NumberDevanagariBengaliTamilThai
0
1
2
3
4
5
6
7
8
9
These glyphs are standardized in Unicode encodings, with zero's evolution from the Gupta dot underscoring its role in enabling positional notation across variants.

Classification of Scripts

Historical and Extinct Forms

The Brahmic scripts encompass a diverse array of historical writing systems derived from the ancient Brahmi script, many of which flourished across ancient India, Central Asia, and beyond before falling into disuse. These extinct forms provide critical insights into the cultural, linguistic, and political dynamics of their eras, often disappearing due to imperial shifts, the spread of new scripts like Arabic following Islamic expansions, or gradual assimilation into evolving regional variants. While some left behind significant epigraphic records, such as inscriptions on coins, stupas, and manuscripts, their extinction highlights the transient nature of writing systems amid language changes and colonization.[42][43][44] Key extinct Brahmic scripts include several prominent examples, each tied to specific regions and timelines. The Kharoshthi script, developed in the Gandhara region of present-day northern Pakistan and Afghanistan, emerged around the 3rd century BCE under Achaemenid and Indo-Greek influences and was used for Prakrit dialects until the 3rd century CE. It became extinct following the decline of the Kushan Empire, supplanted by Gupta-derived scripts amid political fragmentation and the rise of new administrative systems.[42][45] In South India, the Bhattiprolu script represents an early southern variant of Brahmi, attested from the 3rd to 1st century BCE through inscriptions on relic caskets unearthed in Andhra Pradesh. This script, characterized by rounded forms adapted to local stone engraving, faded with the consolidation of Telugu-Kannada scripts, likely due to linguistic standardization in emerging Dravidian kingdoms.[46][47] Further afield in Central Asia, the Tocharian script, a northeastern Brahmi derivative, was employed from the 5th to 8th century CE in the Tarim Basin for Indo-European Tocharian languages, appearing in Buddhist manuscripts along the Silk Road. Its extinction coincided with the Uyghur adoption of the Old Uyghur script and the broader decline of Tocharian speakers under Turkic migrations and Islamic influences.[43][48] The Gupta script, prevalent in northern India from the 4th to 6th century CE during the Gupta Empire's golden age, served primarily for Sanskrit in royal inscriptions and coins. Though it influenced later northern scripts like Nagari, the original Gupta form waned with the empire's fragmentation and the emergence of regional adaptations amid post-Gupta political instability.[49][50] Siddham, originating in northern India around the 6th century CE, was a refined Brahmi descendant used for esoteric Buddhist texts and mantras, spreading to East Asia via Silk Road transmission. By the 13th century CE, it had become extinct in India due to the decline of Buddhism under Delhi Sultanate pressures and the dominance of Perso-Arabic scripts, though it persisted longer in Japan for ritual purposes.[51][52] In the northwest, the Sharada script evolved from Gupta Brahmi by the 8th century CE in Kashmir and surrounding regions for Sanskrit and Kashmiri, featuring in temple inscriptions until its sharp decline in the 20th century. Its near-extinction resulted from colonial linguistic policies favoring Perso-Arabic and Devanagari, coupled with demographic shifts and the 1947 partition, leaving only ceremonial uses in Sikh traditions.[53][54] Eastern India's Bhaiksuki script, a Brahmi offshoot from the 11th century CE, was confined to Bihar and Bengal for Mahayana Buddhist Sanskrit texts, known from palm-leaf manuscripts. It vanished by the 12th century CE with the Pala Empire's fall to Muslim invasions, as Arabic-influenced scripts gained prominence in scholarly and administrative contexts.[55][56] The Kutila script, an 9th-10th century CE transitional form in northern and eastern India, bridged Gupta and proto-Nagari styles, appearing in inscriptions like the Bareilly grant for Prakrit and Sanskrit. Its obsolescence stemmed from the standardization of Devanagari under medieval Hindu kingdoms, absorbing its features into more uniform systems.[57][58] Historical Nagari, emerging around the 7th century CE in northwestern India as a Siddham derivative, was used for Sanskrit literature and evolved into modern Devanagari by the 10th century. The early form became extinct through gradual evolution and the spread of Perso-Arabic scripts post-12th century invasions, though its legacy endures in contemporary usage.[59][60] These scripts' legacies are preserved in archaeological finds, such as Kharoshthi's coin legends from Taxila and Tocharian's Kucha manuscripts, underscoring their role in disseminating Buddhist and Indic knowledge before geopolitical upheavals led to their demise.[42][43]

Northern Brahmic Scripts

Northern Brahmic scripts form a subgroup of the Brahmic family, primarily adapted for Indo-Aryan languages spoken in northern India, Nepal, and adjacent regions. These scripts evolved from the Gupta script around the 4th to 7th centuries CE, retaining a predominantly linear arrangement of characters with a distinctive horizontal headline or top bar that connects consonants and supports vowel diacritics. This structural feature facilitates efficient writing on palm leaves and paper, promoting a horizontal flow from left to right. Unlike southern variants, northern scripts emphasize straight lines and angular forms, reflecting regional scribal traditions in the Indo-Gangetic plains.[61] The major scripts in this group include Devanagari, Bengali-Assamese, Gurmukhi, Gujarati, and Odia, each showing distinct regional adaptations while sharing core abugida principles. Devanagari features a prominent horizontal top bar (shirorekha) that aligns characters uniformly, as seen in its use for classical Sanskrit texts. Bengali-Assamese adopts a more cursive style with fluid, rounded connections between letters, aiding rapid handwriting in eastern regions. Gurmukhi incorporates angled strokes and lacks a full top bar, creating a block-like appearance suited to Punjabi's phonetic needs. Gujarati simplifies forms by omitting the top bar entirely in modern usage, resulting in compact, cursive glyphs. Odia maintains rounded elements with a partial headline, blending northern linearity with subtle curves. These adaptations arose from local materials and linguistic shifts between the 10th and 15th centuries.[61][24] These scripts serve as official writing systems for numerous Indo-Aryan languages across India and Nepal, supporting over 800 million speakers collectively as of 2025. Devanagari is the official script for Hindi (approximately 615 million total speakers) and Sanskrit in India, as well as Nepali (around 28 million total speakers) in Nepal. Bengali-Assamese is used for Bengali (approximately 272 million total speakers) and Assamese (about 15 million native speakers), holding official status in India's West Bengal and Assam states. Gurmukhi is the primary script for Punjabi (roughly 125 million total speakers), recognized officially in India's Punjab. Gujarati supports the Gujarati language (about 60 million native speakers) as an official script in Gujarat state. Odia is the official script for the Odia language (around 50 million total speakers) in Odisha state. Their widespread usage underscores their role in education, administration, and literature, with Devanagari alone appearing in over 120 languages.[62][63][64][65][66][67] A key unique trait of northern Brahmic scripts is their full retention of aspirated consonants, a phonological feature inherited from Sanskrit and preserved in the orthography. Each consonant series includes voiced, voiceless, aspirated voiced, and aspirated voiceless stops (e.g., k, kh, g, gh), represented by distinct glyphs without merger, unlike some southern adaptations that simplify aspiration. This allows precise notation of Indo-Aryan phonetics, such as the breathy h-sound in words like "khana" (food). The scripts typically feature 33-47 consonants and 10-18 vowels, with matras (diacritics) modifying inherent vowels.[1] The following table lists five major northern Brahmic scripts, including example words for "hello" or greeting, rendered in Unicode for visual representation:
ScriptLanguages Primarily UsedExample Word (Greeting)Script Sample
DevanagariHindi, Sanskrit, Nepaliनमस्ते (namaste)नमस्ते
Bengali-AssameseBengali, Assameseনমস্কার (nomoskar)নমস্কার
GurmukhiPunjabiਸਤਿ ਸ੍ਰੀ ਅਕਾਲ (sat sri akal)ਸਤਿ ਸ੍ਰੀ ਅਕਾਲ
GujaratiGujaratiકેમ છો (kem cho)કેમ છો
OdiaOdiaନମସ୍କାର (namaskara)ନମସ୍କାର
These examples highlight script-specific forms: Devanagari's aligned top bar, Bengali's cursive loops, Gurmukhi's angular baseline, Gujarati's simplified curves, and Odia's rounded contours.[24]

Southern Brahmic Scripts

The Southern Brahmic scripts represent a distinct branch of the Brahmic family, primarily adapted for the Dravidian languages spoken in southern India. These scripts evolved from early forms of Brahmi, such as Tamil-Brahmi and Pallava Grantha, to accommodate the phonological characteristics of Dravidian tongues, which lack the aspirated consonants common in Indo-Aryan languages. Unlike their northern counterparts, southern scripts feature more rounded and compact letterforms, often with circular loops and curves that facilitate writing on palm leaves using styluses, a traditional medium in the region. This design emphasis on curvature and simplicity reflects both aesthetic preferences and practical adaptations for the languages' agglutinative structure and retroflex sounds.[68] Key adaptations include reductions in the consonant inventory to align with Dravidian phonology, such as the omission of aspirates and fricatives absent in native words. For instance, the Tamil script limits consonants to 18 basic forms, focusing on alveolar and retroflex series that are phonemically prominent in Dravidian. Telugu and Kannada scripts incorporate extensive circular loops in graphemes like ఱ (ṛa in Telugu) and ಱ (in Kannada), enhancing visual flow and distinctiveness. Vowel representations are streamlined, with matras (diacritics) positioned around consonants in ways that support Dravidian vowel harmony patterns, where front and back vowels influence each other within words, reducing the need for complex diphthong notations. Grantha, used alongside these for Sanskrit texts, retains a fuller set of sounds but influences southern script evolution by introducing ligatures for clusters. These modifications prioritize efficiency for Dravidian syllable structures, which favor open syllables and avoid initial consonant clusters.[69][70][71] These scripts serve as official writing systems for India's southern states and are used by over 250 million speakers of major Dravidian languages as of 2025, underscoring their cultural and administrative significance. They underpin rich literary traditions, notably Tamil's ancient Sangam literature, a corpus of poetry from the 3rd century BCE to 3rd century CE that documents early Tamil society, ethics, and ecology through akam (interior/love) and puram (exterior/heroic) themes. This body of work, compiled in anthologies like the Ettuttokai, exemplifies the scripts' role in preserving oral traditions in written form, influencing modern Dravidian literature and identity. The scripts' enduring use in education, media, and governance highlights their adaptability amid language reforms, such as Malayalam's 1971 simplification to reduce conjunct forms.[72][73] The following outlines five primary Southern Brahmic scripts, including their geographic distribution, unique graphemes, and a sample text (the phrase "Southern script" transliterated for clarity):
  • Tamil Script: Predominantly used in Tamil Nadu and parts of Sri Lanka, with about 86 million total speakers as of 2025. Unique graphemes include simplified consonants like ழ (ḻa, retroflex lateral) and ற (ṟa, alveolar trill), omitting aspirates. Sample: தெற்கு எழுத்து (Teṟku eḻuttu, "Southern script").[74][69][67]
  • Telugu Script: Official in Andhra Pradesh and Telangana, spoken by approximately 95 million total speakers as of 2025. Features circular loops in letters like గ (ga) and ళ (ḷa), with stacked matras for vowels. Sample: దక్షిణ లిపి (Dakṣiṇa lipi, "Southern script").[74][70][67]
  • Kannada Script: Employed in Karnataka, with around 56 million total speakers as of 2025. Characterized by rounded, spiral-derived forms such as ಕ (ka) and ಳ (ḷa), emphasizing cursive connectivity. Sample: ದಕ್ಷಿಣ ಲಿಪಿ (Dakṣiṇa lipi, "Southern script").[74][75][67]
  • Malayalam Script: Used in Kerala, spoken by about 38 million total speakers as of 2025. Includes reformed graphemes like ഴ (ḻa) and stacked forms for clusters, with a high degree of vowel diacritic integration. Sample: ദക്ഷിണ ലിപി (Dakṣiṇa lipi, "Southern script").[74][76][77]
  • Grantha Script: Historically employed in Tamil Nadu and Kerala for Sanskrit and religious texts, influencing modern southern scripts; current usage limited to religious contexts with no large speaker base. Unique for retaining Indo-Aryan sounds with graphemes like ஜ (ja, voiced palatal) and extensive ligatures. Sample: 𑌦𑌅𑍍𑌷𑌿𑌣 𑌲𑍍𑌇𑌨𑌿 (Dakṣiṇa lipi, "Southern script").[78]

Southeast Asian Brahmic Scripts

Southeast Asian Brahmic scripts emerged as a distinct regional adaptation of the ancient Brahmi script through the cultural diffusion of Indian writing systems via maritime trade routes and the influence of Hindu-Buddhist kingdoms, particularly from the 7th to 9th centuries CE. Derived mainly from the Pallava script of southern India or the related Old Mon script, these abugidas were modified to represent the phonetics of Austroasiatic languages like Khmer and Mon, as well as Tai-Kadai and Austronesian languages. This evolution addressed local phonological needs, such as complex consonant clusters and tonal distinctions absent in original Indic forms, resulting in scripts that prioritize visual complexity and syllabic efficiency.[79][80] The historical spread of these scripts was facilitated by the Khmer Empire (9th–15th centuries CE), which propagated the Khmer script across mainland Southeast Asia, influencing neighboring systems through political dominance and religious dissemination. Indian traders and monks introduced early forms as early as the 5th century CE, but widespread adoption occurred with the rise of local empires. Today, these scripts maintain official or co-official status in several nations, underscoring their role in literacy, administration, and cultural preservation amid modernization pressures. For example, the Thai script serves as the official writing system in Thailand, while the Khmer script is central to Cambodian identity.[81][82] Key adaptations include the addition of diacritics for tone indication in tonal languages, a feature not prominent in subcontinental Brahmic scripts. The Thai script employs five tone marks—realized through combinations of four diacritics and syllable structure—to denote its five tones, enhancing the abugida's capacity for Tai languages. Similarly, the Lao script incorporates tone marks derived from Thai influences. In contrast, the Khmer script modified the abugida for Mon-Khmer phonology by developing stacked (subjoined) consonants, allowing up to three consonants per syllable without additional vowels, which creates intricate vertical ligatures unique to the region. These changes reflect a shift toward accommodating register contrasts and implosive sounds in Austroasiatic tongues.[83][81] The following table outlines six prominent Southeast Asian Brahmic scripts, their derivation paths, unique features, and approximate user numbers based on primary language speakers employing the script in daily or official contexts as of 2025:
ScriptDerivation PathUnique FeaturesUsage Stats (approx. users)
KhmerFrom Pallava (7th century CE); earliest inscriptions ca. 611 CEStacked consonants (up to 3 per syllable); no inherent vowel after final consonant; 33 consonants, 23 vowels16 million (official in Cambodia; primary for Khmer language)[80][81]
ThaiFrom Old Khmer (13th century CE); standardized under King RamkhamhaengFour tone diacritics for 5 tones; rounded letterforms; 44 consonants, 15 vowel symbols71 million (official in Thailand; used for Thai and minority languages)[83][84]
LaoFrom Thai/Khmer (16th century CE); influenced by MonSimplified Thai forms with 4 tone marks; 27 consonants, 28 vowels; cursive style7.5 million (official in Laos; for Lao language and scriptures)[85][81]
JavaneseFrom Kawi (ancient Javanese, 8th–9th centuries CE), via Pallava/BrahmiMurda (special sandhi forms for loanwords); 20 consonants, 5 vowels; used for poetry (tembang)<1 million active (ceremonial/literary in Indonesia; Javanese language has 84 million speakers, mostly Latin script)[86][87]
BalineseFrom Kawi (9th century CE onward), closely related to JavaneseAksara swara (independent vowels); ornate ligatures; 18 consonants, 4 vowel signs~200,000 active (used in Bali for rituals and signs; Balinese language ~4 million speakers)[86][88]
BurmeseFrom Mon (11th century CE), via Pallava/Old MonCircular letterforms; 33 consonants, 12 vowels; no spaces between words33 million (official in Myanmar; for Burmese language)[89]
These scripts continue to evolve digitally, with Unicode support enabling their use in contemporary media, though challenges like script complexity persist in education and computing.

East Asian Brahmic Scripts

East Asian Brahmic scripts represent adaptations of the Brahmi family transmitted via Buddhist missions and trade routes from the 7th century CE onward, primarily for Tibetic languages in the Himalayan region and surrounding areas. Unlike Southeast Asian variants, these scripts incorporate stacked consonants and specific diacritics to handle Tibetan phonology, including tones and retroflex sounds, while maintaining the abugida structure with an inherent vowel. Tibetan script, the primary example, evolved from the Gupta-derived script under the Tibetan Empire, featuring a horizontal baseline with subjoined letters below for clusters, facilitating compact writing on paper and woodblocks. This branch emphasizes uniformity for religious texts, with limited regional variations compared to South Asian forms.[24] The Tibetan script serves as the official writing system for Tibetan (approximately 6 million native speakers as of 2025) in Tibet Autonomous Region, Bhutan, Nepal, and India, as well as for related languages like Ladakhi and Sherpa. It underpins the preservation of Buddhist canon (Kangyur and Tengyur) and secular literature, with over 1 million users in digital contexts. Note that the traditional Mongolian script (Hudum) is not Brahmic, deriving instead from the Uyghur script (Aramaic lineage), despite occasional historical associations with Indic influences.[90]

Modern Representation and Usage

Unicode Encoding

Brahmic scripts are represented in the Unicode Standard through dedicated code blocks allocated primarily in the Basic Multilingual Plane (BMP) for widely used modern scripts and in the Supplementary Multilingual Plane (SMP) for historical or less common variants. Major contemporary Brahmic scripts occupy sequential 128-code-point blocks starting from U+0900, reflecting their shared abugida structure derived from ancient Brahmi. For instance, Devanagari is encoded in U+0900–U+097F, Bengali in U+0980–U+09FF, Gurmukhi in U+0A00–U+0A7F, Gujarati in U+0A80–U+0AFF, Odia in U+0B00–U+0B7F, Tamil in U+0B80–U+0BFF, Telugu in U+0C00–U+0C7F, Kannada in U+0C80–U+0CFF, Malayalam in U+0D00–U+0D7F, and Sinhala in U+0D80–U+0DFF. Southeast Asian derivatives include Thai in U+0E00–U+0E7F and Lao in U+0E80–U+0EFF, while Tibetan occupies U+0F00–U+0FFF due to its extended repertoire.[91]
ScriptUnicode BlockRange
DevanagariDevanagariU+0900–U+097F
BengaliBengaliU+0980–U+09FF
GurmukhiGurmukhiU+0A00–U+0A7F
GujaratiGujaratiU+0A80–U+0AFF
OdiaOdiaU+0B00–U+0B7F
TamilTamilU+0B80–U+0BFF
TeluguTeluguU+0C00–U+0C7F
KannadaKannadaU+0C80–U+0CFF
MalayalamMalayalamU+0D00–U+0D7F
SinhalaSinhalaU+0D80–U+0DFF
ThaiThaiU+0E00–U+0E7F
LaoLaoU+0E80–U+0EFF
TibetanTibetanU+0F00–U+0FFF
Historical and extinct forms, such as ancient Brahmi itself, are encoded in the SMP at U+11000–U+1107F, added in Unicode 6.0 to support epigraphic studies. Recent additions include Marchen (U+11C70–U+11CBF), a Bön liturgical script incorporated in Unicode 9.0 (2016), and Nandinagari (U+119A0–U+119FF), a southern Indian manuscript script added in Unicode 12.0 (2019).[92][93] Encoding for Brahmic scripts follows the Unicode virama model, where text is stored in logical order—consonants preceding post-base vowel signs and matras—while rendering engines reorder elements visually to match orthographic conventions, such as stacking diacritics above or below the base consonant. The virama (e.g., U+094D DEVANAGARI SIGN VIRAMA) suppresses the inherent vowel of a consonant and enables conjunct formation when followed by another consonant, as in the sequence <U+0915 DEVANAGARI LETTER KA, U+094D DEVANAGARI SIGN VIRAMA, U+0930 DEVANAGARI LETTER RA> for the conjunct क्‍र (kra). This model ensures phonetic storage but requires sophisticated shaping for visual display.[94][95] The zero-width joiner (ZWJ, U+200D) is used in Brahmic scripts to control variant conjunct forms, particularly where multiple ligature styles exist; for example, in Devanagari, inserting ZWJ after a virama can force a half-form or explicit two-part conjunct instead of a full ligature, as in <U+0915, U+094D, U+200D, U+091C DEVANAGARI LETTER JA> to render क्‍ज without fusion. This allows precise representation of orthographic preferences across regional variations.[96][97] Early Unicode versions (1.0–3.0, 1991–2000) provided basic coverage for major Brahmic scripts like Devanagari, Bengali, and Tamil but had gaps in vowel signs, conjunct glyphs, and regional variants, limiting full digital support for complex texts until expansions in Unicode 4.0 (2003) and later. Ongoing efforts address endangered scripts through proposals; for instance, in 2024–2025, documents have been submitted for Sirmauri (a contemporary Brahmi-derived script from northern India) and Leke (an endangered script from Arunachal Pradesh), aiming for inclusion in future versions to preserve linguistic heritage.[98][99] Rendering Brahmic scripts poses challenges due to their two-dimensional syllable structure, requiring advanced layout engines to handle matra reordering, matra-conjunct interactions, and stacked diacritics; legacy systems often fail without such support, leading to garbled output. Open-source engines like HarfBuzz implement the OpenType Universal Shaping Engine for Brahmic scripts, processing sequences such as <U+092C DEVANAGARI LETTER BA, U+093E DEVANAGARI VOWEL SIGN AA> to position the matra correctly as बा, but inconsistencies arise in cross-platform rendering of ZWJ-mediated variants or rare conjuncts.[95]

Contemporary Applications and Challenges

Brahmic scripts remain integral to contemporary society in South and Southeast Asia, serving essential functions in education, media, and religious liturgy. In India, scripts like Devanagari are central to school curricula for languages such as Hindi and Sanskrit, where they facilitate literacy and cultural transmission. In media, Devanagari appears prominently in film titles, subtitles, and print publications, including Bollywood productions that reach global audiences. Similarly, the Thai script, a Southeast Asian Brahmic derivative, dominates public signage, educational materials, and Buddhist liturgical texts, reinforcing national identity and daily communication. Digital adoption has accelerated this usage, with India's digital economy valued at approximately $368 billion (₹28.94 lakh crore) in 2022–23, driven by around 660 million internet users as of early 2022 engaging with content in regional scripts.[100][101] By 2025, projections indicate continued growth, with language technologies supporting Indic scripts in apps and online platforms to bridge digital divides.[102] Despite these advancements, Brahmic scripts face significant challenges in digital and practical domains, particularly for lesser-used variants. Font availability remains limited for minor scripts like Lepcha, which is spoken by around 60,000 people in India and Nepal, hindering consistent rendering on websites and devices. Keyboard layouts pose another barrier due to the scripts' abugida structure, requiring specialized input methods that accommodate conjunct consonants and diacritics; studies highlight the need for user-friendly soft keyboards to improve typing efficiency across devices. Script reform movements address complexity issues, as seen in the 1971 Malayalam orthographic reform, which eliminated certain ligatures to simplify printing and reading, though it sparked debates on cultural authenticity and continues to influence modern typography.[103] Preservation efforts are crucial for endangered Brahmic forms, supported by international and national initiatives. UNESCO's Missing Scripts program aids in documenting and digitizing indigenous writing systems, including vulnerable Brahmic scripts like Modi, used historically for Marathi and now at risk of obsolescence. Digital archives, such as India's Gyan Bharatam Mission launched in 2025, focus on converting palm-leaf manuscripts and historical texts in various Brahmic scripts into accessible formats, ensuring long-term safeguarding against physical decay. Recent AI-driven projects, including post-2023 developments for Brahmi script recognition, employ machine learning to generate fonts and restore faded inscriptions, enhancing scholarly access and revitalization.[104][105][106] In global diaspora communities, Brahmic scripts sustain cultural ties among immigrants, often alongside Latin transliteration for accessibility. Tamil communities in Canada and the UK, for instance, use Romanized Tamil (e.g., "vanakkam" for வணக்கம்) in informal writing, social media, and community publications, balancing heritage preservation with integration into host societies. Transliteration tools like Brahmi-Net facilitate conversions between native scripts and Romanization for Indo-Aryan and Dravidian languages, aiding communication in multicultural settings. These practices highlight the scripts' adaptability, though they underscore ongoing needs for inclusive digital tools to support overseas users.[107]

References

User Avatar
No comments yet.