Recent from talks
Contribute something
Nothing was collected or created yet.
Ol Chiki script
View on Wikipedia
| |
|---|---|
| Script type | Alphabet
|
| Creator | Raghunath Murmu |
Period | 1925–present |
| Direction | Left-to-right |
| Languages | Santali language |
| ISO 15924 | |
| ISO 15924 | Olck (261), Ol Chiki (Ol Cemet’, Ol, Santali) |
| Unicode | |
Unicode alias | Ol Chiki |
| U+1C50–U+1C7F | |
| Part of a series on |
| Writing systems in India |
|---|
The Ol Chiki (ᱚᱞ ᱪᱤᱠᱤ, Santali pronunciation: [ɔl tʃiki], ɔl 'writing', tʃiki 'symbol') script, also known as Ol Chemetʼ (ᱚᱞ ᱪᱮᱢᱮᱫ, ol 'writing', chemetʼ 'learning'), Ol Ciki, Ol, and Santali alphabet is the official writing system for Santali, an Austroasiatic language recognized as an official regional language in India. It was invented by Pandit Raghunath Murmu in 1925. It has 30 letters, the design of which is intended to evoke natural shapes. The script is written from left to right, and has two styles (the print Chapa style and cursive Usara style). Unicode does not maintain a distinction between these two, as is typical for print and cursive variants of a script. In both styles, the script is unicameral (that is, it does not have separate sets of uppercase and lowercase letters).
The shapes of the letters are not arbitrary, but reflect the names for the letters, which are words, usually the names of objects or actions representing conventionalized form in the pictorial shape of the characters.
— Norman Zide, A portal for Santals[1][self-published source?]
History
[edit]The Ol Chiki script was created in 1925 by Raghunath Murmu for the Santali language, and publicized first in 1939 at a Mayurbhanj State exhibition.[2] Unlike most Indic scripts, Ol Chiki is not an abugida, but is a true alphabet: giving the vowels equal representation with the consonants.

Before the invention of Ol Chiki script, Santali was written in Bangla, Devanagari, Kalinga and Latin script. However, Santali is not an Indo-Aryan language and Indic scripts did not have letters for all of Santal's phonemes, especially its stop consonants and vowels, which make it difficult to write the language accurately in an unmodified Indic script.
For example, when missionary and linguist Paul Olaf Bodding, a Norwegian, studied the Santali language and needed to decide how to transcribe it (in producing his widely followed and widely respected reference books such as A Santal Dictionary), he decided to transcribe Santhali in the Roman alphabet: despite his observation that Roman script lacks many of the advantages of the Indic scripts, he concluded that the Indic scripts could not adequately serve the Santali language because the Indic scripts lack a way to indicate important features of Santali pronunciation (such as glottalization, combined glottalization and nasalization, and checked plosives, which can be more easily represented in the Roman alphabet through the use of diacritics.[3]
The phonology of the Santali language had also been similarly analyzed by various other authors, including Byomkes Chakrabarti in Comparative Study of Santali and Bengali and Baghrai Charan Hembram in A Glimpse of Santali Grammar. However, the Ol Chiki alphabet is considered (by many Santali) to be even more appropriate for the language, because its letter-shapes are derived from the sounds of common Santali words and other frequent Santali morphemes:[a] nouns, demonstratives, adjectives, and verb roots in the Santali language.[4] In other words, each Santali letter's name is, or is derived from, a common word or other element of the Santali language, and each letter's shape is derive from a simple drawing of the meaning of that word or other element. For example, the Santali letter “ol” (representing the sound /l/) is written with a shape originally derived from a simplified outline drawing of a hand holding a pen, because the name of this letter is also the Santali word for “writing.”
Print and cursive styles
[edit]
The existence of these two styles of Ol Chiki was mentioned by the script's creator: Guru Gonke Pandit Raghunath Murmu (also known as Pandit Murmu) in his book Ol Chemed[5] which explains and teaches the Ol Chiki script.[b] Chhapa (Santali for 'print') is used for publication, while usaraà (Santali for 'quick') is used for handwriting.
Chhapa hand
[edit]Ol Chiki chhapa, or print style, is the more common style for digital fonts, and is used in the printing of books and newspapers.
Usaraà hand
[edit]Usaraà or usaraà ol is the cursive style, and is largely limited to pen and paper, though there are digital usaraà typefaces. Differences include the diacritic ahad, which in print style is used with ᱜ, ᱡ, ᱦ, ᱫ, and ᱵ, all of which can form ligatures with ᱽ in cursive.[6] Further, cursive usaraà seldom uses several letter-shapes which are formed by combining the letter ᱦ and the four semi-consonants ᱜ, ᱡ, ᱫ, and ᱵ with ahad; instead, these are generally written in a shorter form, as ᱷ.
Letters
[edit]The values of the Ol Chiki letters are as follows:
| Letter | Name | IPA[7] | Transliteration | Shape[1] | ||||
|---|---|---|---|---|---|---|---|---|
| ALA-LC[8] | Zide[7] | Deva.[6] | Beng.[6] | Odia[6] | ||||
| ᱚ | la | /ɔ/ | a | ọ | अ | অ | ଅ | burning fire |
| ᱟ | laa | /a/ | ā | a | आ | আ | ଆ | working in the field with a spade |
| ᱤ | li | /i/ | i | i | इ | ই | ଇ | bending tree |
| ᱩ | lu | /u/ | u | u | उ | উ | ଉ | vessel used for preparing food |
| ᱮ | le | /e/ | e | e | ए | এ | ଏ | overflowing rivers changing course |
| ᱳ | lo | /o/ | o | o | ओ | ও | ଓ | a mouth when sounding this letter |
| ᱶ | ov | /w̃/ | ṅ | w̃ | ङ | ঙ | ଙ୍ | nasalized |
| ᱠ | aak | /k/ | k | k | क | ক | କ୍ | bird (sound of a swan) |
| ᱜ | ag | /kʼ/, /g/ | g | kʼ | ग | গ | ଗ୍ | vomiting mouth, which produces the same sound as the name of the letter |
| ᱝ | ang | /ŋ/ | ṃ | ṅ | ं | ং | ଂ | blowing air |
| ᱪ | uch | /c/ | c | c | च | চ | ଚ୍ | peak of a mountain which is usually high |
| ᱡ | aaj | /cʼ/, /ɟ/ | j | cʼ | ज | জ | ଜ୍ | person pointing towards a third person with the right hand (saying “he”) |
| ᱧ | iny | /ɲ/ | ñ | ñ | ञ | ঞ | ଞ୍ | person pointing towards himself/herself with the left hand |
| ᱴ | ott | /ʈ/ | ṭ | ṭ | ट | ট | ଟ୍ | camel hump |
| ᱰ | edd | /ɖ/ | ḍ | ḍ | ड | ড | ଡ୍ | a man with two legs stretching towards his chest and mouth |
| ᱬ | unn | /ɳ/ | ṇ | ṇ | ण | ণ | ଣ୍ | picture of a flying bee (which Is described by Santali speakers as making this sound) |
| ᱛ | at | /t/ | t | t | त | ত | ତ୍ | the Earth |
| ᱫ | ud | /tʼ/, /d/ | d | tʼ | द | দ | ଦ୍ | mushroom or umbrella |
| ᱱ | en | /n/ | n | n | न | ন | ନ୍ | threshing grains with two legs |
| ᱯ | ep | /p/ | p | p | प | প | ପ୍ | person receiving with both hands |
| ᱵ | ob | /pʼ/, /b/ | b | pʼ | ब | ব | ବ୍ | curly hair |
| ᱢ | aam | /m/ | m | m | म | ম | ମ୍ | person pointing towards a second person with the left hand (saying “you”) |
| ᱭ | uy | /j/ | y | y | य | য় | ୟ୍ | a man bending towards the ground to cut something |
| ᱞ | al | /l/ | l | l | ल | ল | ଲ୍ | writing |
| ᱨ | ir | /r/ | r | r | र | র | ର୍ | sickle used for cutting or reaping |
| ᱣ | aaw | /w/, /v/ | w | w | व | ওয় | ୱ୍ | opening lips |
| ᱥ | is | /s/ | s | s | स | স | ସ୍ | plow |
| ᱦ | ih | /ʔ/, /h/ | ẖ | h | ह | হ | ହ୍ | hands up |
| ᱲ | err | /ɽ/ | ṛ | ṛ | ड़ | ড় | ଡ଼୍ | a path that turns to avoid an obstruction or a danger |
| ᱷ | oh | /ʰ/ | h | (C)h | ह | হ | ହ୍ | a man throwing something with one hand |
Aspirated consonants are written as digraphs with the letter ᱷ:[9][6] ᱛᱷ /tʰ/, ᱜᱷ /gʱ/, ᱠᱷ /kʰ/, ᱡᱷ /jʱ/, ᱪᱷ /cʰ/, ᱫᱷ /dʱ/, ᱯᱷ /pʰ/, ᱰᱷ /ɖʱ/, ᱲᱷ /ɽʱ/, ᱴᱷ /ʈʰ/, and ᱵᱷ /bʱ/.
Other marks
[edit]Ol Chiki employs several marks which are placed after the letter they modify (there are no combining characters):
| Mark | Name | Description |
|---|---|---|
| ᱹ | găhlă ṭuḍăg | This baseline dot is used to extend three vowel letters for the Santal Parganas dialect of Santali:[9] ᱚᱹ ŏ /ɔ/, ᱟᱹ ă /ə/, and ᱮᱹ ĕ /ɛ/. The phonetic difference between ᱚ and ᱚᱹ is not clearly defined and there may be only a marginal phonemic difference between the two. ᱚᱹ is rarely used. ALA-LC transliterates ᱟᱹ as "ạ̄".[8] |
| ᱸ | mũ ṭuḍăg | This raised dot indicates nasalization of the preceding vowel: ᱚᱸ /ɔ̃/, ᱟᱸ /ã/, ᱤᱸ /ĩ/, ᱩᱸ /ũ/, ᱮᱸ /ẽ/, and ᱳᱸ /õ/. ALA-LC transliteration uses "m̐" after the affected vowel.[8] |
| ᱺ | mũ găhlă ṭuḍăg | This colon-like mark is used to mark a nasalized extended vowel. It is a combination of mũ ṭuḍăg and găhlă ṭuḍăg: ᱚᱺ /ɔ̃/, ᱟᱺ /ə̃/, and ᱮᱺ /ɛ̃/. |
| ᱻ | relā | This tilde-like mark indicates the prolongation of any oral or nasalized vowel. Compare ᱮ /e/ with ᱮᱻ /eː/. It comes after the găhlă ṭuḍăg for extended vowels: ᱮᱹᱻ /ɛː/. It is omitted in ALA-LC transliteration.[8] |
| ᱽ | ahad | This special letter indicates the deglottalization of a consonant in the word-final position. It preserves the morphophonemic relationship between the glottalized (ejective) and voiced equivalents of consonants.[9] For example, ᱜ represents a voiced /g/ when word initial but an ejective /kʼ/ when in the word-final position. A voiced /g/ in the word-final position is written as ᱜᱽ. The ahad is used with ᱜ, ᱡ, ᱦ, ᱫ, and ᱵ which can form cursive ligatures with ᱽ in handwriting (but not usually in printed text).[6] ALA-LC transliteration uses an apostrophe (ʼ) to represent an ahad.[8] |
| ᱼ | phārkā | This hyphen-like mark serves as a glottal protector (the opposite function as the ahad.) It preserves the ejective sound, even in the word-initial position. Compare ᱜᱚ /gɔ/ with ᱜᱼᱚ /kʼɔ/. The phārkā is only used with ᱜ, ᱡ, ᱫ, and ᱵ. It is omitted in ALA-LC transliteration.[8] |
Numerals
[edit]Ol Chiki has its own numerals:
| Hindu-Arabic numerals | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|---|---|
| Ol Chiki | ᱐ | ᱑ | ᱒ | ᱓ | ᱔ | ᱕ | ᱖ | ᱗ | ᱘ | ᱙ |
Punctuation
[edit]Some Western-style punctuation marks are used with Ol Chiki: the comma (,), exclamation mark (!), question mark (?), and quotation marks (“ and ”).
The period/fullstop (.) is not used, because it is visually confusible with the găhlă ṭuḍăg mark (ᱹ).;[6] therefore, instead of periods, the script uses single or two Ol Chiki short dandas:
- ᱾ (mucăd) marks a minor break
- ᱿ (double mucăd) marks a major break
Computing
[edit]Unicode
[edit]Ol Chiki script was added to the Unicode Standard in April, 2008 with the release of version 5.1.
The Unicode block for Ol Chiki is U+1C50–U+1C7F:
| Ol Chiki[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
| U+1C5x | ᱐ | ᱑ | ᱒ | ᱓ | ᱔ | ᱕ | ᱖ | ᱗ | ᱘ | ᱙ | ᱚ | ᱛ | ᱜ | ᱝ | ᱞ | ᱟ |
| U+1C6x | ᱠ | ᱡ | ᱢ | ᱣ | ᱤ | ᱥ | ᱦ | ᱧ | ᱨ | ᱩ | ᱪ | ᱫ | ᱬ | ᱭ | ᱮ | ᱯ |
| U+1C7x | ᱰ | ᱱ | ᱲ | ᱳ | ᱴ | ᱵ | ᱶ | ᱷ | ᱸ | ᱹ | ᱺ | ᱻ | ᱼ | ᱽ | ᱾ | ᱿ |
Notes
| ||||||||||||||||
Fonts
[edit]- Google's Noto Sans Ol Chiki.[10]
- Microsoft's font family Nirmala UI.[11]
Mixing the two letter styles
[edit]Although Ol Chiki (Chapa) and Ol Chiki (Usara) are normally never mixed, and the original inventor never mentioned mixing these letter styles, there have been some works that mix both forms, using them like English capital and small letters. However, this innovation is yet to be accepted officially.[12]
The invention of a lower case for Ol Chiki
[edit]Since 2017, Santali graphic designer, typographer, and film producer Sudip Iglesias Murmu has been working on design principles to provide a lowercase alphabet form for Ol Chiki, which would permit Ol Chiki writing and keyboarding to use a two-case, or bicameral, format (Using both uppercase and lowercase), as is done in many other written languages, including the Roman-alphabet languages such as English (all of which were once unicameral scripts, but evolved into a bicameral stage over time). As the development of a lowercase form is contributed to developing a standardized cursive form (in those writing systems which use one), the evolution of lowercase is likely to allow standardizing cursive to the point of making it type able alongside more rigid "block" printed letterforms forms So far, only Ol Chiki (Chapa) letters are used in keyboarding, typesetting, and publishing (in effect, producing capitals-only text for the entirety of all printed or keyboarded documents). In writing quickly by hand, Ol Chiki (Usara) is used: but, despite Ol Usara's potential for reaching high speed, the circulation of Ol Usara documents is negligible, and Ol Usara is yet to receive Unicode standardization, thus leaving it still neglected.
In hopes to remedy this situation and to harmonize the two scripts, Sudip Iglesias Murmu has innovated by creating a series of lowercase letters, which he has integrated with the already existing font of Ol Chiki. According to him, providing lowercase letters increases the efficiency of keyboarding, both for Ol Chiki (Chapa) and for Ol Chiki (Usara), and allows keyboarding to reach the same speed that can be obtained when typing Santali in Roman-alphabet letters, which are likewise case-sensitive. However, his work is yet to be accepted officially.[13]
See also
[edit]- Byomkes Chakrabarti (a Bengali research worker on ethnic languages)
- Santali Latin alphabet
- Turia Chand Baskey
References
[edit]- ^ a b "Ol Chiki Script". A portal for Santals. 2002. Archived from the original on 8 August 2018. Retrieved 12 September 2017.
- ^ Hembram, Phatik Chandra (2002). Santhali, a Natural Language. U. Hembram. p. 165.
- ^ Hembram, Baghrai Charan (2012). A Glimpse of Santali Grammar. Noha Trust Bahalda Mayurbhanj odisha. p. 05. Archived from the original on 30 September 2022. Retrieved 23 June 2022.
- ^ Hembram, Baghrai Charan (2012). A Glimpse of Santali Grammar. Noha Trust Bahalda Mayurbhanj odisha. p. 01.
- ^ Samal, A. P. Subhakanta (14 January 2022). PERSONALITIES OF ODISHA. Shubhdristi Publication. ISBN 978-93-5593-204-4. Archived from the original on 26 September 2023. Retrieved 22 June 2022.
- ^ a b c d e f g Everson, Michael (5 September 2005). "L2/05-243R: Final proposal to encode the Ol Chiki script in the UCS" (PDF). Archived (PDF) from the original on 31 July 2015. Retrieved 19 September 2019.
- ^ a b Zide, Norman (1996). Daniels, Peter T.; Bright, William (eds.). The World's Writing Systems. Oxford University Press, Inc. pp. 614-615. ISBN 978-0195079937.
- ^ a b c d e f "Santali (in Ol script)" (PDF). ALA-LC Romanization Tables. Library of Congress. Archived (PDF) from the original on 4 December 2018. Retrieved 12 September 2017.
- ^ a b c "The Unicode Standard, Chapter 13.10: Ol Chiki" (PDF). Unicode Consortium. March 2020. Archived (PDF) from the original on 5 July 2020. Retrieved 21 March 2020.
- ^ "Noto Sans Ol Chiki". Google Noto Fonts. Archived from the original on 8 January 2017. Retrieved 5 June 2020.
- ^ "Nirmala UI font family - Typography". docs.microsoft.com. Retrieved 5 June 2020.
- ^ Ajanta HC (11 August 2021). "Ol Chiki Lower Case Letters Invented by Sudip Iglesias Murmu". PRLog (Press release). Mumbai. Archived from the original on 22 November 2021. Retrieved 2 June 2022.
- ^ "Ol Chiki moulded into perfection by invention of lowercase letters - Ajanta Heritage & Culture". 9 August 2021. Archived from the original on 26 May 2022. Retrieved 2 June 2022.
Ol Chiki script
View on GrokipediaHistory
Invention and early development
The Santali language, spoken by millions across eastern India, historically lacked a dedicated writing system and relied on non-native scripts such as Devanagari, Bengali-Assamese, Odia, Latin, and the earlier Ol Nagri script.[10][11] These scripts proved inadequate for capturing the unique phonological features of Santali, including its distinct consonants and vowels, which led to inconsistencies in representation and hindered the language's literary and cultural expression.[12][11] Raghunath Murmu, born on May 5, 1905, in Dandbese village, Mayurbhanj district (now in Odisha), was a Santali writer, educator, and cultural activist who recognized these limitations during his early life and teaching career.[6][10] Inspired by the natural shapes observed in his surroundings and the phonetic needs of Santali, Murmu began developing a new script around 1920–1925, aiming to create a system that reflected the language's sounds while drawing from Santali cultural motifs.[6] He finalized the Ol Chiki script in 1925, designing it with 30 letters for precise phonetic accuracy, written from left to right without conjunct forms to simplify learning and usage, and incorporating symbols evocative of Santali heritage, such as forms resembling arrows, birds, and other natural elements.[10][11] The script received its first public demonstration in 1939 at the Mayurbhanj State exhibition, where Murmu showcased its potential through printed materials and explanations.[10] Early publications in Ol Chiki followed soon after, with Murmu's Horh Sereng—a collection of songs—appearing in 1936 as one of the initial printed works, followed by his play Bidu Chandan in 1942, which helped propagate the script within Santali communities during the 1930s and 1940s.[6] These efforts marked the script's nascent phase, emphasizing its role in fostering Santali literacy independent of external scripts.[6]Adoption and official recognition
Following India's independence in 1947, efforts to promote the Ol Chiki script gained momentum through grassroots movements in Santal communities, particularly the Ol Chiki movement in Mayurbhanj district, Odisha, during the 1970s, where local educators and activists organized workshops to teach the script and resist the dominance of borrowed scripts like Devanagari and Bengali used for Santali.[13][14] This period saw the formation of script committees, such as those under the All India Santali Writers' Association, which advocated for Ol Chiki's standardization amid opposition from proponents of alternative scripts, including Romanized forms favored by Christian missionaries.[15] By the late 1970s, the Government of West Bengal approved Ol Chiki as the official script for Santali primary education, marking an early institutional push despite logistical hurdles like limited printing resources.[16] Key milestones in official recognition followed in the early 2000s, with West Bengal providing full acceptance for official purposes, including education, in 2001 after sustained advocacy.[17] The inclusion of Santali in the Eighth Schedule of the Indian Constitution in 2003 further elevated the language's status, implicitly supporting Ol Chiki as its primary script and facilitating federal funding for its promotion across states like West Bengal, Bihar, and Assam.[18] This led to efforts to integrate the script into education in states such as Jharkhand and Odisha, though adoption varied, with experimental programs in Odisha facing challenges and other scripts remaining prevalent in Jharkhand.[19] Educational integration accelerated from the 1990s, with Ol Chiki introduced in primary schools in Odisha and West Bengal, where Santali-medium classes using the script began to address linguistic barriers for Adivasi students.[20] From 1983 to 1992, over 59,600 copies of Ol Chiki textbooks were published and distributed for primary education in West Bengal, covering subjects up to Class III (and mathematics up to Class IV), supporting the growth of Santali literature to include hundreds of titles in genres like novels, poetry, and folklore.[16] This expansion fostered cultural preservation, though challenges persisted, including shortages of trained teachers and interruptions in textbook supply due to funding issues in the mid-1990s.[17] Recent developments from 2020 to 2025 have emphasized digital adoption, accelerated by the COVID-19 pandemic, with initiatives like Unicode's full support for Ol Chiki enabling online Santali resources for remote education in Jharkhand and Odisha.[21] Government programs, including teacher training workshops in Ol Chiki pedagogy, have been rolled out in states like West Bengal to bridge digital divides, alongside the 2022 translation of India's Constitution into the script to promote accessibility.[18] In December 2024, the Odisha government announced plans to provide education in Santali using the Ol Chiki script from kindergarten to postgraduate levels. In August 2025, Assam introduced Santali lessons in the Ol Chiki script across schools. The year 2025 also commemorates the 100th anniversary of the script's invention, with initiatives like 'Mission Ol-Chiki 2025' promoting its use.[22][23] Persistent challenges include the historical scarcity of typewriters for Ol Chiki until the 1990s, which delayed printing, and ongoing competition from Romanized Santali in online spaces, where transliteration tools dominate due to easier accessibility on non-specialized devices.[24]Script design and characteristics
Phonetic principles
The Ol Chiki script functions as a phonemic alphabet specifically designed to capture the sounds of the Santali language, diverging from the syllabic nature of traditional Brahmic scripts like Devanagari.[2][25] Created to provide an accurate and unambiguous representation of Santali phonemes, it assigns a unique symbol to each distinct sound, ensuring that the script reflects the language's phonological inventory without the ambiguities introduced by adapting non-native scripts.[4][25] Santali phonology, as encoded in Ol Chiki, features six basic vowels—typically /a/, /i/, /u/, /e/, /o/, and /ɔ/—along with 24 consonants that include retroflex sounds, aspirated stops, glottalized consonants, and a glottal stop, many of which lack direct equivalents in Devanagari.[26][25] These consonants encompass a range of articulatory positions, such as labials, dentals, retroflexes, and velars, with additional distinctions for unreleased stops and voiceless /h/.[4] The script accommodates dialectal variations, such as the Orissa Mayurbhanj dialect's six vowels, while using modifiers to represent nasalization, length, and additional vowel qualities in other dialects that may have up to nine vowels.[2][25] The core structure of Ol Chiki is a true alphabetic system in which vowels are indicated by dedicated letters, appearing independently, before, or after consonants. There is no inherent vowel; standalone consonants may be realized with an echo vowel approximating a nearby vowel or /ə/.[4] Syllables are formed by combining a consonant (or cluster) with a following vowel letter or digraph, maintaining a linear sequence without reordering or special forms for consonant clusters.[4] The script's 30 basic letters are organized in a 6x5 matrix, with the first column dedicated to vowels and the remaining columns grouping consonants by articulatory features, such as place and manner of articulation, to facilitate phonetic learning and recall.[26][25] Key innovations in Ol Chiki include the absence of ligatures or conjunct characters, unlike Brahmic scripts that require complex combinations for clusters, allowing for straightforward rendering of Santali's 24 consonants and avoidance of representational gaps for sounds like glottalized stops.[2][4] Instead of vertical stacking, the script employs horizontal placement of vowel letters or digraphs, such as the "Ahad" diacritic for deglottalization to link related phonemes like /k'/ and /g/.[25] Written from left to right in horizontal lines, it promotes clarity in syllable formation—typically consonant-vowel (CV) or consonant-vowel-consonant (CVC)—and supports the language's five diphthongs through precise phoneme mapping, ensuring no loss of phonetic nuance.[4][2]Letter inventory
The Ol Chiki alphabet comprises 30 core letters, consisting of 24 consonants and 6 independent vowels, each representing a single phoneme in the Santali language. This alphabetic structure ensures vowels have equal status to consonants, unlike abugidas common in other Indian scripts. The letters are arranged in a traditional 6×5 matrix, with vowels in the first column and consonants filling the remaining positions, reflecting the script's systematic design.[4][5] The six vowel letters are used independently at the start of syllables or in isolation, while diacritics (covered separately) attach to consonants for vowels in other positions. Their basic uppercase print forms, along with transliterations and International Phonetic Alphabet (IPA) values, are as follows:| Glyph | Name | Transliteration | IPA |
|---|---|---|---|
| ᱚ | LA | a | /ɔ/ |
| ᱟ | LAA | ā | /a/ |
| ᱤ | LI | i | /i/ |
| ᱩ | LU | u | /u/ |
| ᱮ | LE | e | /e/ |
| ᱳ | LO | o | /o/ |
| Glyph | Name | Transliteration | IPA |
|---|---|---|---|
| ᱯ | EP | p | /p/ |
| ᱵ | OB | b | /p̚, b/ |
| ᱢ | AAM | m | /m/ |
| ᱣ | AAW | w | /w, v/ |
| ᱶ | OV | v | /ʋ/ |
| Glyph | Name | Transliteration | IPA |
|---|---|---|---|
| ᱛ | AT | t | /t/ |
| ᱫ | UD | d | /t̚, d/ |
| ᱥ | IS | s | /s/ |
| ᱱ | EN | n | /n/ |
| ᱞ | AL | l | /l/ |
| ᱨ | IR | r | /r/ |
| Glyph | Name | Transliteration | IPA |
|---|---|---|---|
| ᱪ | UC | c | /t͡ɕ/ |
| ᱡ | AAJ | j | /ɟ, t͡ɕ̚/ |
| ᱧ | INY | ñ | /ɲ/ |
| ᱭ | UY | y | /j/ |
| Glyph | Name | Transliteration | IPA |
|---|---|---|---|
| ᱴ | OTT | ṭ | /ʈ/ |
| ᱰ | EDD | ḍ | /ɖ/ |
| ᱬ | UNN | ṇ | /ɳ/ |
| ᱲ | ERR | ṛ | /ɽ/ |
| Glyph | Name | Transliteration | IPA |
|---|---|---|---|
| ᱠ | AAK | k | /k/ |
| ᱜ | AG | g | /ɡ, k̚/ |
| ᱝ | ANG | ṃ | /ŋ/ |
| Glyph | Name | Transliteration | IPA |
|---|---|---|---|
| ᱦ | IH | h | /ʔ, h/ |
| ᱷ | OH | h | /h/ |
Graphical styles
Print style (Chhapa hand)
The print style of the Ol Chiki script, known as Chhapa hand, takes its name from the Santali term chhapa meaning "print," and is characterized by block-like, angular forms optimized for clarity in typesetting and formal publications.[27] Developed as the primary variant by inventor Raghunath Murmu in 1925, Chhapa employs straight lines and geometric shapes inspired by natural and cultural elements familiar to the Santali community, such as tools and environmental forms, to create distinct, non-joining glyphs.[27][28] This design ensures uniform height across letters, with no serifs or curves that could complicate reproduction, making it ideal for mechanical printing processes. The script is unicameral, lacking a distinction between uppercase and lowercase forms, which contributes to its consistent glyph appearance.[27] Key features of Chhapa include its alphabetic structure adapted for Santali phonetics, where each of the 30 core letters represents a consonant or vowel sound, often modified by diacritics for tones or nasality, all rendered in a rigid, non-cursive manner.[27] For example, the letter ᱛ (for the sound /t/) adopts a form resembling the shape of the Earth, while ᱚ (for /o/) uses a simple looped geometry evoking burning fire.[27][10][29] These angular constructions prioritize legibility over fluidity, distinguishing Chhapa from the more flowing Usaraà hand used in handwriting.[27] Historically, Chhapa has been the dominant form since the script's early adoption, appearing in the first Ol Chiki book, Horh Sereng (Song of the Sun), published in 1936, and subsequent works by Murmu, who produced over 150 titles in the script by the mid-20th century.[27][10] Its use expanded in the 1940s through printed materials in Santali communities across eastern India, including educational texts and cultural documents, and it remains standard in modern newspapers and books for its compatibility with metal type and offset printing.[27][28] The advantages of Chhapa lie in its simplicity and robustness, facilitating easy readability at a distance and efficient production on early printing presses, which helped promote Santali literacy despite limited resources.[27] This style's geometric precision also supports seamless digital rendering today, though its original intent was to empower printed dissemination of Santali literature and identity.[27]| Letter | Print Form (Chhapa) | Sound | Inspiration |
|---|---|---|---|
| ᱚ | ᱚ | /o/ | Shape of burning fire |
| ᱛ | ᱛ | /t/ | Shape of the Earth |
| ᱥ | ᱥ | /s/ | Plough form |
Cursive style (Usaraà hand)
The cursive style of the Ol Chiki script, known as Usaraà hand or Usara ol, refers to the flowing, connected form designed for rapid handwriting in the Santali language.[30] In Santali, "Usara" translates to "quick" or "running," emphasizing its purpose for efficient personal writing, while "ol" denotes writing, distinguishing it from the more rigid print style.[25] This variant maintains the script's core 30-letter inventory—six vowels and 24 consonants—but adapts them with rounded, interconnected strokes to facilitate speed and natural flow.[30] Key features of Usaraà hand include curved lines that join letters within words through ligatures, allowing for seamless transitions between characters, and variations in slant and loop sizes to accommodate individual handwriting habits.[25] These elements enable the formation of fluid word shapes, such as the connected rendering of "baha" (ᱵᱟᱦᱟ), where the initial ᱵ flows into ᱟ via a gentle curve, followed by ᱦ and ᱟ linked by a descending loop, contrasting the discrete forms in print.[31] The style also incorporates diacritics like the gaahlaa ttuddaag for extended vowels, positioned dynamically to preserve readability in motion.[30] Developed alongside the print style (Chhapa hand) by Pandit Raghunath Murmu in the 1920s, Usaraà hand emerged to support everyday documentation and artistic applications in Santali communities, gaining popularity in folk art, signage, and handwritten literature.[25] Its integration into Santali cultural expression, including poetry and novels, reflects Murmu's vision of a script rooted in natural shapes from the environment, such as the rounded ᱟ evoking fire or the looped ᱢ suggesting a mushroom.[25] Among its advantages, Usaraà hand promotes economical handwriting by minimizing lifts of the pen, mirroring the natural rhythm of hand movement and reducing fatigue during extended writing sessions.[25] It is primarily taught in informal settings, such as community workshops and family traditions, fostering accessibility for non-formal learners in regions where Santali is spoken by over six million people across India, Bangladesh, and Nepal.[30] This style's emphasis on fluidity has also influenced modern font designs, like the Ol Chiki Usara handwriting variant, adapting traditional forms for contemporary use.[32]Orthographic elements
Diacritics and other marks
The Ol Chiki script utilizes a set of modifier letters that serve as diacritics to convey phonological modifications, such as nasalization, vowel lengthening, and the creation of additional vowel qualities. Unlike combining diacritics in abugida scripts, these modifiers are independent spacing characters placed immediately after the base consonant or vowel letter they affect, ensuring clear syllable structure in this alphabetic system. There are six primary modifiers, all encoded in the Unicode block for Ol Chiki (U+1C50–U+1C7F).[5][4] The mu ttuddag (ᱸ, U+1C78) indicates nasalization of the preceding vowel, functioning similarly to an anusvara in other Indic scripts but as a distinct letter. It is applied post-vowel in syllables requiring nasal resonance, such as in Santali words where vowels assimilate to following nasals. The g aahl aa ttuddaag (ᱹ, U+1C79) modifies select base vowels to produce three additional vowel sounds, particularly for the Santal Parganas dialect variant; for instance, it transforms certain short vowels into mid or extended forms when appended. Their combination yields the mu-g aahl aa ttuddaag (ᱺ, U+1C7A), a precomposed mark for nasalized versions of these extended vowels, as seen in examples like ᱚᱺ (/ɔ̃ː/) or ᱟᱺ (/ə̃ː/).[5][26][2] Vowel lengthening is marked by the relaa (ᱻ, U+1C7B), positioned after the vowel to denote prolonged duration, which is phonemically significant in Santali for distinguishing meaning. The ph aark aa (ᱼ, U+1C7C) acts as a glottal protector or separator, preventing unintended glottalization in consonant clusters or marking brief glottal stops. Finally, the ahad (ᱽ, U+1C7D) deglottalizes consonants, removing any inherent or contextual glottal features to represent plain stops accurately. These modifiers attach only to core letters (vowels or consonants) and do not apply to numerals or punctuation.[5][3] In usage, stacking follows a fixed order to avoid ambiguity: quality modifiers like g aahl aa ttuddaag precede nasalization (mu ttuddag), ensuring logical phonetic rendering in complex syllables. Since Ol Chiki lacks an inherent vowel, all syllables explicitly include a vowel letter, with no matras or dependent forms; modifiers thus enhance explicit representations rather than substituting for vowels. For example, the syllable "ba" (/ba/) is ᱵᱟ (letter b + a), while nasalized "bã" becomes ᱵᱟᱸ, and lengthened with additional quality "bāə" (extended form) is ᱟᱹᱻ. This system supports the script's phonetic precision for Santali's six base vowels and derived forms.[26][4][3]Numerals
The Ol Chiki script includes a dedicated set of ten native digits, encoded in Unicode as U+1C50 to U+1C59, representing the values 0 through 9. These digits are ᱐ (zero), ᱑ (one), ᱒ (two), ᱓ (three), ᱔ (four), ᱕ (five), ᱖ (six), ᱗ (seven), ᱘ (eight), and ᱙ (nine). Designed by Pandit Raghunath Murmu alongside the script's letters in 1925, the numerals feature angular lines and geometric forms that harmonize with the overall aesthetic of the Ol Chiki alphabet, which draws inspiration from natural shapes and everyday objects to evoke cultural resonance.[2][33] The digits follow the standard decimal positional notation and are used in print (Chhapa hand) and basic cursive (Usaraà hand) variants, aligning with the script's dual stylistic forms for formal printing and faster handwriting. In print style, the digits maintain blocky, distinct outlines suitable for typography, while cursive variants adopt slanted, narrower proportions to facilitate fluid writing, though Unicode encoding treats them uniformly without glyph distinctions. These forms ensure readability in both contexts, supporting the script's left-to-right direction.[4][33] Introduced concurrently with the script's invention, the numerals have been integral to Santali literature and documentation since the early 20th century, appearing in educational materials and publications from regions like Mayurbhanj district. In contemporary usage, they denote dates, quantities, and lists within Santali texts, such as enumerations on digital platforms like the Santali Wikipedia. For instance, the year 2025 is written as ᱒᱐᱒᱕. Ol Chiki digits are compatible with Latin numerals in mixed-language documents, allowing seamless integration in bilingual contexts without altering positional values.[2][4]| Value | Ol Chiki Digit | Santali Name (example) |
|---|---|---|
| 0 | ᱐ | - |
| 1 | ᱑ | ᱚᱠ (okē) |
| 2 | ᱒ | ᱵᱟᱨ (bar) |
| 3 | ᱓ | ᱯᱮ (pe) |
| 4 | ᱔ | ᱯᱩᱱ (pun) |
| 5 | ᱕ | ᱢᱚᱬᱮ (môṇe) |
| 6 | ᱖ | ᱯᱚᱯ (pop) |
| 7 | ᱗ | ᱮᱱᱟᱜ (eṅaṛ) |
| 8 | ᱘ | ᱮᱛᱟᱜ (eṭaṛ) |
| 9 | ᱙ | ᱚᱭᱚ (oye) |
Punctuation
The Ol Chiki script employs a combination of native punctuation marks and borrowed elements from the Latin script to indicate pauses, sentence boundaries, and other syntactic functions in Santali texts.[4] The core native marks are the mucaad (᱾) and double mucaad (᱿), which serve as delimiters analogous to the danda and double danda in Devanagari scripts.[3] The mucaad, a single vertical bar-like glyph (U+1C7E), primarily marks the end of a sentence or a minor break, functioning similarly to a full stop while avoiding the Latin period (.) to prevent confusion with the mu ttuddag diacritic (ᱹ).[5][4] The double mucaad (U+1C7F), consisting of two such bars, denotes a major break, such as at the end of a paragraph or section, or in poetic verse to indicate stanza divisions.[5][3] For shorter pauses and clause separations, Ol Chiki adapts the Latin comma (,), which aligns with Santali's syntactic structure emphasizing post-positions and clause chaining without altering word order significantly.[4] Other borrowed Latin marks include the exclamation mark (!) for emphasis, the question mark (?)—often used in a hybrid form combining Ol Chiki elements with the Latin curve for interrogative sentences—and quotation marks (“ ” and ‘ ’) for direct speech or citations.[4][3] Colon (:) and semicolon (;) are occasionally employed from Latin for lists or complex clauses, though their use remains less standardized in traditional Santali prose.[4] These conventions reflect an adaptation to Santali's agglutinative grammar, where punctuation highlights relational morphemes and discourse flow rather than strict Indo-European sentence isolation.[3] Punctuation practices in Ol Chiki evolved from early 20th-century manuscripts by inventor Raghunath Murmu, who initially favored simple vertical lines akin to the Latin pipe (|), termed "muchad" in Santali.[2] Standardization occurred in the 2000s through Unicode encoding (version 5.1, 2008), which introduced native glyphs for mucaad and double mucaad to ensure cultural distinctiveness in print and digital media.[5] In handwriting, variations persist, such as elongated or stylized mucaad forms in cursive Usaraà style, though print Chhapa style adheres closely to Unicode norms.[4] Representative examples illustrate these roles. A simple declarative sentence might end with mucaad: ᱨᱮᱱ ᱵᱚᱸ ᱦᱚᱨ ᱟᱹᱜᱤᱫ ᱾ (Ren bôn hor ângìt mucaad, meaning "The sun rises."). For questions, the hybrid mark combines with Latin: ᱦᱚᱨ ᱟᱹᱜᱤᱫ ᱵᱚ? (Hor ângìt bo?, "Does the sun rise?"). In poetry, double mucaad separates stanzas: ᱨᱮᱱ ᱵᱚᱸ ᱦᱚᱨ ᱟᱹᱜᱤᱫ᱾ᱱᱚ ᱢᱮᱱᱟᱜ ᱞᱚᱠᱟᱜ ᱿ (Ren bôn hor ângìt; nomaeng lokâg double mucaad, "The sun rises; the bird flies.").[3][2]Computing and digital use
Unicode encoding
The Ol Chiki Unicode block, designated U+1C50–U+1C7F, was added to the Unicode Standard in version 5.1.0, released in April 2008, providing 48 code points for the script's core elements. This allocation supports the Santali language's writing system by including dedicated ranges for digits, letters, modifier letters, and punctuation, ensuring comprehensive representation within the Basic Multilingual Plane.[5] The block's specific assignments are as follows: digits occupy U+1C50 to U+1C59 (OL CHIKI DIGIT ZERO to OL CHIKI DIGIT NINE); letters span U+1C5A to U+1C77, encompassing 30 characters such as consonants (e.g., U+1C5A OL CHIKI LETTER LA) and vowels (U+1C60 OL CHIKI LETTER A to U+1C65 OL CHIKI LETTER II); modifier letters for diacritics and other marks range from U+1C78 OL CHIKI MU TTUDDAG (nasalization) to U+1C7D OL CHIKI AHAD; and punctuation consists of U+1C7E OL CHIKI PUNCTUATION MUCAAD and U+1C7F OL CHIKI PUNCTUATION DOUBLE MUCAAD.[5] No substantive modifications have been made to the block since its introduction, with its structure preserved through subsequent versions, including Unicode 17.0 released in September 2025.[34][5] Ol Chiki employs spacing characters throughout, with modifier letters functioning as non-combining marks placed adjacently to the base letters they modify, such as for vowel alteration or nasalization; consequently, no diacritic stacking occurs, and Unicode normalization forms like NFC are not required for canonical equivalence. The encoding aligns fully with ISO/IEC 10646, facilitating international standardization.[4] Representative hex code examples include U+1C60 for the vowel letter A (ᱠ) and U+1C78 for the mu ttuddag nasalization mark (ᱸ).[5]Fonts and typography
The development of fonts for the Ol Chiki script has focused on supporting its unique letterforms, which draw inspiration from natural shapes, while addressing the needs of Santali language users for both print and digital typography. Major typefaces include Noto Sans Ol Chiki, a sans-serif design by Google with multiple weights (regular, medium, semibold, bold) and 55 glyphs, released under the SIL Open Font License and suitable for body text in unmodulated styles. Typotheque offers professional Ol Chiki support in families like October and November, developed from 2016 onward for high-legibility applications in headlines and extended reading, with rounded terminals for approachable signage.[35] Free options include the Uniol font, an open-source Unicode-compliant typeface available on GitHub, and Ol Chiki Biren, a TrueType font for desktop and mobile use in Santali writing.[36][37] Design considerations for Ol Chiki fonts emphasize accurate rendering of its 30 letters and 6 diacritics, such as the rela and phaarka marks, which require precise placement to maintain readability without combining characters.[33] Kerning adjustments are critical for elements like the mucaad punctuation, often refined through native speaker feedback to optimize spacing around diacritics and ensure natural flow.[33] Glyph spacing varies between print (upright, monothick forms) and simulations of the cursive Usaraà hand, which features slanted, narrower strokes and ligated diacritics like ahad for faster handwriting emulation.[33] Variable fonts, as in Noto Sans Ol Chiki, allow interpolation between weights to support stylistic flexibility without multiple static files. Recent developments from 2020 to 2025 include open-source contributions on GitHub, such as the olchiki-fontfamily project, which advances a libre typeface with regular, bold, and italic variants commissioned for broader accessibility.[38] Typotheque's ongoing refinements, highlighted in 2023 research, incorporate community input for minority scripts like Ol Chiki, enhancing support in signage and digital interfaces.[27] These efforts build on the 2014–2016 CIS-A2K project, which produced an initial Unicode-compliant family under SIL OFL, including italic styles derived from handwritten samples.[33] Prior to 2010, font coverage for Ol Chiki was severely limited, with few Unicode-compliant options and reliance on non-standard encodings that hindered digital adoption.[33] Typography challenges persist in line-breaking rules for Santali words, which follow Unicode's left-to-right horizontal algorithm with space-separated terms, but require custom handling in complex layouts to avoid awkward breaks around diacritics or punctuation.[4][39] Rendered examples in Noto Sans Ol Chiki show crisp, even spacing for text like "ᱚᱞ ᱪᱤᱠᱤ" (Ol Chiki), with diacritics aligning seamlessly above baselines, while Typotheque's October renders the same phrase with softer curves for warmer, headline-appropriate aesthetics.[40] In free fonts like Uniol, the script appears monospaced and uniform, ideal for basic digital documents but less refined for professional kerning.[36]Input methods and software support
Input methods for the Ol Chiki script, used primarily for the Santali language, include standardized keyboard layouts and phonetic transliteration tools that facilitate typing on various devices. The InScript keyboard layout, developed as the official Indian standard for Indic scripts, provides a direct mapping for Ol Chiki characters on QWERTY keyboards, allowing users to input the script's 30 letters, diacritics, and numerals through specific key combinations. Phonetic input methods, which transliterate Roman letters into Ol Chiki glyphs based on pronunciation, are widely available through tools like PramukhIME, enabling users to type English-like spellings that convert to the native script.[41] On mobile devices, apps such as the Ol Chiki Basic keyboard from Keyman offer phonetic and direct input options, supporting swipe gestures for efficient typing on Android and iOS platforms.[42] Software support for Ol Chiki has expanded with Unicode integration, allowing input in major applications. Microsoft Windows provides native keyboard and font support for Ol Chiki starting from Windows 8.1, using the Nirmala UI font and keyboard identifier 000D0C00 for seamless text entry in programs like Microsoft Word and Notepad.[43] LibreOffice supports Ol Chiki through its Unicode compatibility and recent locale additions for Santali, with community-driven extensions enhancing spell-checking and hyphenation for the script.[44] Web-based editors, such as Google Docs, enable Ol Chiki input via browser extensions or integrated tools like Gboard's Santali keyboard, which handles Unicode rendering without additional setup.[45] Recent developments from 2020 to 2025 have improved accessibility, particularly through AI-enhanced tools. Google Translate added official support for Ol Chiki in 2024, incorporating AI-driven translation that assists users in generating and verifying Santali text by translating from English or other languages, effectively aiding typing workflows.[46] Android keyboards like Olchiki Keyboard and Keyman now include swipe input for Ol Chiki, allowing fluid word formation by gliding over phonetic keys, while iOS support via Keyman extends to dictation features for voice-to-text conversion in the script.[47] These advances build on Unicode standardization to promote digital adoption among Santali speakers. Despite progress, challenges persist in input ecosystems. Legacy systems using ASCII-based encodings for Ol Chiki require conversion tools to Unicode, as older software often fails to render or input the script correctly, leading to compatibility issues on pre-2010 platforms.[48] User training remains a barrier, with many learners needing manuals to master phonetic mappings or InScript layouts, though resources like Keyman's documentation help mitigate this.[49] For example, using a phonetic method in tools like Lexilogos or PramukhIME, the Santali word "sermaa" (meaning "sky") is typed as follows: enter "s" for ᱥ, "e" for ᱮ, "r" for ᱨ, "m" for ᱢ, "aa" for ᱟ with length mark, then apply space or conversion to yield ᱥᱮᱨᱢᱟ.[50] This approach simplifies entry for non-experts while preserving the script's phonetic accuracy.Extensions and variations
Development of lowercase forms
The Ol Chiki script, originally designed as a unicameral system by Pandit Raghunath Murmu in 1925 without distinction between uppercase and lowercase forms, saw proposals for lowercase variants emerge in the 2010s to address limitations in digital adaptability and stylistic flexibility. These extensions aimed to emulate the bicameral structure of the Latin script, facilitating mixed-language texts and enhancing readability in bilingual contexts involving Santali.[51][52] Santali graphic designer, typographer, and film producer Sudip Iglesias Murmu led the development starting in 2017, conducting extensive research into the script's print (chapa) and cursive (Ol Usara) styles to create complementary lowercase forms. Murmu's work involved designing simplified, smaller versions of the 30 uppercase letters, resulting in 30 lowercase variants that integrate seamlessly with existing fonts for dual-case usage. These forms were experimentally added after iterating through thousands of character designs, prioritizing aesthetic harmony with the script's natural-inspired shapes.[52][53] The lowercase variants are not part of the original Ol Chiki design and remain non-standardized, lacking official Unicode encoding beyond the uppercase block (U+1C50–U+1C7F). As of 2025, they are primarily employed in experimental digital typography, casual online writing, and select print materials like children's educational resources or prototype Santali texts, such as font-based transliterations of religious works. This innovation promotes easier keyboarding on QWERTY layouts and improves visual flow in sentences, though adoption is limited due to the script's traditional unicameral emphasis on phonetic purity.[52][53]Mixing print and cursive styles
The Ol Chiki script has two primary graphical styles: the print style (Chhapa hand) and the cursive style (Usaraà hand). While these styles are distinct and normally not mixed, the cursive style incorporates ligatures, such as those joining certain consonants with the AHAD modifier (ᱷ), to facilitate quicker handwriting. Zide (1996), as referenced in Unicode proposals, notes the efficiency of such AHAD ligatures in enhancing writing for Santali.[2] In digital contexts, the zero-width joiner (ZWJ) can be used to force specific ligatures, such as those involving AHAD, in fonts supporting handwriting forms.[2] Although the original inventor did not mention mixing Chhapa and Usaraà styles, limited experimental hybrid forms have appeared in some digital prototypes and artistic works, often simulated via variable fonts to interpolate between styles for educational or media purposes. However, inconsistent software rendering and lack of standardization pose challenges, and such mixing remains rare outside dedicated communities.[4]References
- https://commons.wikimedia.org/wiki/File:Ol_chiki_cursive.png
