Recent from talks
Contribute something
Nothing was collected or created yet.
Bengali language
View on Wikipedia
| Bengali | |
|---|---|
| বাংলা (Bangla) | |
The word "Bangla" in the Bengali-Assamese script (Bengali alphabet) | |
| Pronunciation | [baŋˈla] ⓘ |
| Native to | Bangladesh and India |
| Region | |
| Ethnicity | Bengalis |
| Speakers | L1: 242 million (2011–2023)[1][2] L2: 43 million (2011–2023)[1] Total: 284 million (2011–2023)[1] |
Early forms | Magadhi Prakrit
|
| Dialects |
|
| Official status | |
Official language in |
|
| Regulated by |
|
| Language codes | |
| ISO 639-1 | bn |
| ISO 639-2 | ben |
| ISO 639-3 | ben |
| Glottolog | beng1280 |
| Linguasphere | 59-AAF-u |
Geographical distribution of the Bengali language. Darker shades imply a greater percentage of native speakers. | |
| Part of a series on |
| Bengalis |
|---|
Bengali,[a] also known by its endonym Bangla (বাংলা, Bāṅlā [baŋˈla] ⓘ), is a classical Indo-Aryan language belonging to the Indo-Iranian branch of the Indo-European language family. It is primarily spoken by the Bengali people, native to the Bengal region (Bangladesh, India's West Bengal and Tripura) of South Asia. With over 242 million (24.2 crore) native speakers and another 43 million (4.3 crore) as second language speakers as of 2025,[1] Bengali is the sixth most spoken native language and the seventh most spoken language by the total number of speakers in the world.[7][8]
Bengali is the official, national, and most widely spoken language of Bangladesh,[9][10][11] with 98% of Bangladeshis using Bengali as their first language.[12][13] It is the second-most widely spoken language in India. It is the official language of the Indian states of West Bengal, Tripura and the Barak Valley region of the state of Assam. It is also the second official language of the Indian state of Jharkhand since September 2011.[3] It is the most widely spoken language in the Andaman and Nicobar Islands in the Bay of Bengal,[14] and is spoken by significant populations in other states including Bihar, Arunachal Pradesh, Delhi, Chhattisgarh, Meghalaya, Mizoram, Nagaland, Odisha and Uttarakhand.[15] Bengali is also spoken by the Bengali diasporas (Bangladeshi diaspora and Indian Bengalis) across Europe, North America, the Middle East and other regions.[16]
Bengali was accorded the status of a classical language by the government of India on 3 October 2024.[17][18] It is the second most spoken and fifth fastest growing language in India, following Hindi, Kashmiri, Gujarati, and Meitei (Manipuri), according to the 2011 census of India.[19]
Bengali has developed over more than 1,400 years. Bengali literature, with its millennium-old literary history, was extensively developed during the Bengali Renaissance and is one of the most prolific and diverse literary traditions in Asia. The Bengali language movement from 1948 to 1956 demanding that Bengali be an official language of Pakistan fostered Bengali nationalism in East Bengal leading to the emergence of Bangladesh in 1971. In 1999, UNESCO recognised 21 February as International Mother Language Day in recognition of the language movement.[20][21]
History
[edit]

Ancient
[edit]With the advent of the Indo-Aryans in the 3rd century BCE, Bengal was gradually being Sanskritized.[23] The varieties of Prakrit spoken in Bengal region were generally referred to as "eastern Magadhi Prakrit", as coined by linguist Suniti Kumar Chatterji,[24] as the Middle Indo-Aryan dialects were influential in the first millennium when Bengal was a part of the Greater Magadhan realm. Muhammad Shahidullah attempted to trace the origin of Bengali to Old Indo-Aryan through the Gaudi Prakrit; his findings were later supported by A. B. Keith and others.[25]
The local varieties had no official status during the Gupta Empire, and with Bengal increasingly becoming a hub of Sanskrit literature for Hindu priests and Buddhist Acharyas, the vernacular of Bengal gained much influence from Sanskrit.[26][23] Magadhi Prakrit was also spoken in modern-day Bihar and Assam, and this vernacular eventually evolved into Ardha Magadhi.[27][28] Ardha Magadhi began to give way to what is known as Apabhraṃśa, by the end of the first millennium. The Bengali language evolved as a distinct language over the course of time.[29]
Early
[edit]A Sanskrit-Chinese dictionary compiled by the Chinese poet Li-Yen in 782 AD shows the presence of Bengali. A research document Classical Bangla published in 2024 by the Kolkata-based institute "Institute of Language Studies and Research" (ILSR), mentions the presence of 51 Bengali words in the dictionary. The lexicon strongly supports the existence of Old Bengali in the 8th century or earlier.[30][31]
Though some archaeologists claim that some 10th-century texts were in Bengali, it is not certain whether they represent a differentiated language or whether they represent a stage when Eastern Indo-Aryan languages were differentiating.[32] The local Apabhraṃśa of the eastern subcontinent, Purbi Apabhraṃśa or Abahatta (lit. 'meaningless sounds'), eventually evolved into regional dialects, which in turn formed three groups, the Bengali–Assamese languages, the Bihari languages, and the Odia language.
The language was not static: different varieties coexisted and authors often wrote in multiple dialects in this period. For example, Ardhamagadhi is believed to have evolved into Abahatta around the 6th century, which competed with the ancestor of Bengali for some time.[33][better source needed] The ancestor of Bengali was the language of the Pala Empire and the Sena dynasty.[34][35]
Medieval
[edit]
During the medieval period, Middle Bengali was characterised by the elision of the word-final অ ô and the spread of compound verbs, which originated from the Sanskrit Schwa. Slowly, the word-final ô disappeared from many words influenced by the Arabic, Persian, and Turkic languages.[citation needed] The arrival of merchants and traders from the Middle East and Turkestan into the Buddhist-ruling Pala Empire, from as early as the 7th century, gave birth to Islamic influence in the region.[citation needed]
In the 13th century, subsequent Arab Muslim and Turco-Persian expeditions to Bengal heavily influenced the local vernacular by settling among the native population.[36][37] Bengali absorbed Arabic and Persian influences in its vocabulary and dialect, including the development of Dobhashi.[36]
Bengali acquired prominence, over Persian, in the court of the Sultans of Bengal with the ascent of Jalaluddin Muhammad Shah.[38] Subsequent Muslim rulers actively promoted the literary development of Bengali,[39] allowing it to become the most spoken vernacular language in the Sultanate.[40] Bengali adopted many words from Arabic and Persian, which was a manifestation of Islamic culture on the language. Major texts of Middle Bengali (1400–1800) include Yusuf-Zulekha by Shah Muhammad Sagir and Srikrishna Kirtana by the Chandidas poets. Court support for Bengali culture and language waned when the Mughal Empire conquered Bengal in the late 16th and early 17th century.[41]
Modern
[edit]The standard literary form of Modern Bengali was developed during the 19th and early 20th centuries based on the west-central dialect spoken in Shantipur region of the Nadia district. Modern Bengali shows a high degree of diglossia, with the literary and standard form differing greatly from the colloquial speech of the regions that identify with the language.[42] Modern Bengali vocabulary is based on words inherited from Magadhi Prakrit and Pali, along with tatsamas and reborrowings from Sanskrit and borrowings from Persian, Arabic, Austroasiatic languages and other languages with which it has historically been in contact.
In the 19th and 20th centuries, there were two standard forms of written Bengali:
- চলিতভাষা Chôlitôbhasha, a colloquial form of Bengali using simplified inflections.
- সাধুভাষা Sadhubhasha, a formal and genteel form of Bengali.[43][44]
In 1948, the government of Pakistan tried to impose Urdu as the sole state language in Pakistan, giving rise to the Bengali language movement.[45] This was a popular ethnolinguistic movement in the former East Bengal (today Bangladesh), which arose as a result of the strong linguistic consciousness of the Bengalis and their desire to promote and protect spoken and written Bengali's recognition as a state language of the then Dominion of Pakistan. On 21 February 1952, five students and political activists were killed during protests near the campus of the University of Dhaka; they were the first ever martyrs to die for their right to speak their mother tongue. In 1956, Bengali was made a state language of Pakistan.[45] 21 February has since been observed as Language Movement Day in Bangladesh and has also been commemorated as International Mother Language Day by UNESCO every year since 2000.
In 2010, the parliament of Bangladesh and the legislative assembly of West Bengal proposed that Bengali be made an official UN language.[46] As of January 2023, no further action has been yet taken on this matter. However, in 2022, the UN did adopt Bangla as an unofficial language, after a resolution tabled by India.[47]
In 2024, the government of India conferred Bengali with the status of classical language.[17][18]
Geographical distribution
[edit]- Bangladesh (56.3%)
- India (42.0%)
- Other Countries (1.70%)
The Bengali language is native to the region of Bengal, which comprises the present-day nation of Bangladesh and the Indian state of West Bengal.

Besides the native region it is also spoken by the Bengalis living in Tripura, southern Assam and the Bengali population in the Indian union territory of Andaman and Nicobar Islands. Bengali is also spoken in the neighbouring states of Odisha, Bihar, and Jharkhand, and sizeable minorities of Bengali speakers reside in Indian cities outside Bengal, including Delhi, Mumbai, Thane, Varanasi, and Vrindavan. There are also significant Bengali-speaking communities in the Middle East,[48][49][50] the United States,[51] Singapore,[52] Malaysia, Australia, Canada, the United Kingdom, and Italy.
Official status
[edit]The 3rd article of the Constitution of Bangladesh states Bengali to be the sole official language of Bangladesh.[11] The Bengali Language Implementation Act, 1987, made it mandatory to use Bengali in all records and correspondences, laws, proceedings of court and other legal actions in all courts, government or semi-government offices, and autonomous institutions in Bangladesh.[9] It is also the de facto national language of the country.
In India, Bengali is one of the 23 official languages.[53] It is the official language of the Indian states of West Bengal, Tripura and in Barak Valley of Assam.[54][55] Bengali has been a second official language of the Indian state of Jharkhand since September 2011.
In Pakistan, Bengali is a recognised secondary language in the city of Karachi[56][57][58] mainly spoken by stranded Bengalis of Pakistan. The Department of Bengali in the University of Karachi (established by East Pakistani politicians before Independence of Bangladesh) also offers regular programs of studies at the Bachelors and at the Masters levels for Bengali Literature.[59]
The national anthems of both Bangladesh (Amar Sonar Bangla) and India (Jana Gana Mana) were written in Bengali by the Bengali Nobel laureate Rabindranath Tagore.[60] Notuner Gaan known as "Chol Chol Chol" is Bangladesh's national march, written by The National Poet Kazi Nazrul Islam in Bengali in 1928. It was adopted as the national marching song by the Bangladeshi government in 1972. Additionally, the first two verses of Vande Mataram, a patriotic song written in Bengali by Bankim Chandra Chatterjee, was adopted as the "national song" of India in both the colonial period and later in 1950 in independent India. Furthermore, it is believed by many that the national anthem of Sri Lanka (Sri Lanka Matha) was inspired by a Bengali poem written by Rabindranath Tagore,[61][62][63][64] while some even believe the anthem was originally written in Bengali and then translated into Sinhala.[65][66][67][68]
In 2009, elected representatives in both Bangladesh and West Bengal called for Bengali to be made an official language of the United Nations.[69]
Dialects
[edit]Regional varieties in spoken Bengali constitute a dialect continuum. Linguist Suniti Kumar Chatterji grouped the dialects of Bengali language into four large clusters: Rarhi, Vangiya, Kamrupi and Varendri;[70][71] but many alternative grouping schemes have also been proposed.[72] The West-Central dialects (Rarhi or Nadia dialect) form the basis of modern standard colloquial Bengali. In the dialects prevalent in much of eastern and south-eastern Bangladesh (Barisal, Chittagong, Dhaka and Sylhet Divisions of Bangladesh), many of the stops and affricates heard in West Bengal and western Bangladesh are pronounced as fricatives. Western alveolo-palatal affricates চ [tʃɔ], ছ [tʃʰɔ], জ [dʑɔ] correspond to eastern চ [sɔ], ছ [sɔ], জ [dzɔ~zɔ].
The influence of Tibeto-Burman languages on the phonology of Eastern Bengali is seen through the lack of nasalised vowels and an alveolar articulation of what are categorised as the "cerebral" consonants (as opposed to the postalveolar articulation of western Bengal). Some varieties of Bengali, particularly Sylheti,[73] Chittagonian and Chakma, have contrastive tone; differences in the pitch of the speaker's voice can distinguish words. Kharia Thar and Mal Paharia are closely related to Western Bengali dialects, but are typically classified as separate languages. Similarly, Hajong is considered a separate language, although it shares similarities to Northern Bengali dialects.[74]
During the standardisation of Bengali in the 19th century and early 20th century, the cultural centre of Bengal was in Kolkata, a city founded by the British. What is accepted as the standard form today in both West Bengal and Bangladesh is based on the West-Central dialect of Nadia and Kushtia District.[75] There are cases where speakers of Standard Bengali in West Bengal will use a different word from a speaker of Standard Bengali in Bangladesh, even though both words are of native Bengali descent. For example, the word salt is লবণ lôbôṇ in the east which corresponds to নুন nun in the west.[76]

Bengali exhibits diglossia, though some scholars have proposed triglossia or even n-glossia or heteroglossia between the written and spoken forms of the language.[42] Two styles of writing have emerged, involving somewhat different vocabularies and syntax:[75][77]
- Sadhu bhasha (সাধু ভাষা "upright language") was the written language, with longer verb inflections and more of a Pali and Sanskrit-derived Tatsama vocabulary. Songs such as India's national anthem Jana Gana Mana (by Rabindranath Tagore) were composed in this style. Its use in modern writing however is uncommon, restricted to some official signs and documents as well as for achieving particular literary effects.
- Chôlito bhasha (চলিত ভাষা "running language"), known by linguists as Standard Colloquial Bengali, is a written Bengali style exhibiting a preponderance of colloquial idiom and shortened verb forms and is the standard for written Bengali now. This form came into vogue towards the turn of the 19th century, promoted by the writings of Peary Chand Mitra (Alaler Gharer Dulal, 1857),[78] Pramatha Chaudhuri (Sabujpatra, 1914) and in the later writings of Rabindranath Tagore. It is modelled on the dialect spoken in the Shantipur and Shilaidaha region in Nadia and Kushtia Districts respectively. This form of Bengali is often referred to as the "Kushtia standard"(Bangladesh), "Nadia standard" (West Bengal), "West-Central dialect", "Shantipuri Bangla" or "Shilaidahi Bangla".[72]
Linguist Prabhat Ranjan Sarkar categorises the language as:
- Madhya Rarhi dialect
- Kanthi (Contai) dialect
- Kolkata dialect
- Shantipuriya (Nadia) dialect
- Shershahabadia (Maldahiya/ Jangipuri) dialect
- Barendri dialect
- Rangapuriya dialect
- Sylheti dialect
- Dhakiya (Bikrampuri) dialect
- Jashore/Jessoriya dialect
- Barisal (Chandradwip) dialect
- Chattal (Chittagong) dialect
While most writing is in Standard Colloquial Bengali (SCB), spoken dialects exhibit a greater variety. People in southeastern West Bengal, including Kolkata, speak in SCB. Other dialects, with minor variations from Standard Colloquial, are used in other parts of West Bengal and western Bangladesh, such as the Midnapore dialect, characterised by some unique words and constructions. However, a majority in Bangladesh speaks dialects notably different from SCB. Some dialects, particularly those of the Chittagong region, bear only a superficial resemblance to SCB.[79] The dialect in the Chittagong region is least widely understood by the general body of Bengalis.[79] The majority of Bengalis are able to communicate in more than one variety – often, speakers are fluent in Cholitobhasha (SCB) and one or more regional dialects.[43]
Even in SCB, the vocabulary may differ according to the speaker's religion: Muslims are more likely to use words of Persian and Arabic origin, along with more words naturally derived from Sanskrit (tadbhava), whereas Hindus are more likely to use tatsama (words directly borrowed from Sanskrit).[80] For example:[76]
| Predominantly Hindu usage | Origin | Predominantly Muslim usage | Origin | Translation |
|---|---|---|---|---|
| নমস্কার nômôskār | Directly borrowed from Sanskrit namaskāra | আসসালামু আলাইকুম āssālāmu ālāikum | Directly from Arabic as-salāmu ʿalaykum | hello |
| নিমন্ত্রণ nimôntrôṇ | Directly borrowed from Sanskrit nimantraṇa as opposed to the native Bengali nemôntônnô | দাওয়াত dāowāt | Borrowed from Arabic da`wah via Persian | invitation |
| জল jôl | Directly borrowed from Sanskrit jala | পানি pāni | Native, compare with Sanskrit pānīya | water |
| স্নান snān | Directly borrowed from Sanskrit snāna | গোসল gosôl | Borrowed from Arabic ghusl via Persian | bath |
| দিদি didi | Native, from Sanskrit devī | আপা āpā | From Turkic languages | sister / elder sister |
| দাদা dādā | Native, from Sanskrit dāyāda | ভাইয়া bhāiyā | Native, from Sanskrit bhrātā | brother / elder brother[81] |
| মাসী māsī | Native, from Sanskrit mātṛṣvasā | খালা khālā | Directly borrowed from Arabic khālah | maternal aunt |
| পিসী pisī | Native, from Sanskrit pitṛṣvasā | ফুফু phuphu | Native, from Prakrit phupphī | paternal aunt |
| কাকা kākā | From Persian or Dravidian kākā | চাচা chāchā | From Prakrit cācca | paternal uncle |
| প্রার্থনা prārthonā | Directly borrowed from Sanskrit prārthanā | দোয়া doyā | Borrowed from Arabic du`āʾ | prayer |
| প্রদীপ prôdīp | Directly borrowed from Sanskrit pradīp | বাতি bāti | Native, compare with Prakrit batti and Sanskrit barti | lamp |
| লঙ্কা lônkā | Native, named after Lanka | মরিচ môrich | Directly borrowed from Sanskrit marica | chilli |
Phonology
[edit]The phonemic inventory of standard Bengali consists of 29 consonants and 7 vowels, as well as 7 nasalised vowels. The inventory is set out below in the International Phonetic Alphabet (upper grapheme in each box) and romanisation (lower grapheme).
| Non-nasalised | Nasalised | |||||
|---|---|---|---|---|---|---|
| Front | Central | Back | Front | Central | Back | |
| Close | ই~ঈ i i |
উ~ঊ u u |
ইঁ~ঈঁ ĩ ĩ |
উঁ~ঊঁ ũ ũ | ||
| Close-mid | এ e e |
ও o o |
এঁ ẽ ẽ |
ওঁ õ õ | ||
| Open-mid | অ্যা æ æ |
অ ɔ ô |
অ্যাঁ æ̃ æ̃ |
অঁ ɔ̃ ɔ̃ | ||
| Open | আ a a |
আঁ ã ã |
||||
| Labial | Dental | Retroflex | Palato- alveolar |
Velar | Glottal | |||
|---|---|---|---|---|---|---|---|---|
| Nasal | m | n | ŋ | |||||
| Plosive/ Affricate |
voiceless | unaspirated | p | t̪ | ʈ | tʃ | k | |
| aspirated | pʰ | t̪ʰ | ʈʰ | tʃʰ | kʰ | |||
| voiced | unaspirated | b | d̪ | ɖ | dʒ | ɡ | ||
| aspirated | bʱ | d̪ʱ | ɖʱ | dʒʱ | ɡʱ | |||
| Fricative | voiceless | (ɸ) | s | ʃ | (h) | |||
| voiced | (β) | (z) | ɦ | |||||
| Approximant | (w) | l | (j) | |||||
| Rhotic | unaspirated | r | ɽ | |||||
| aspirated | (ɽʱ) | |||||||
Bengali is known for its wide variety of diphthongs, combinations of vowels occurring within the same syllable.[82] Two of these, /oi̯/ and /ou̯/, are the only ones with representation in script, as ঐ and ঔ respectively. /e̯ i̯ o̯ u̯/ may all form the glide part of a diphthong. The total number of diphthongs is not established, with bounds at 17 and 31. An incomplete chart is given by Sarkar (1985) of the following:[83]
| e̯ | i̯ | o̯ | u̯ | |
|---|---|---|---|---|
| a | ae̯ | ai̯ | ao̯ | au̯ |
| æ | æe̯ | æo̯ | ||
| e | ei̯ | eu̯ | ||
| i | ii̯ | iu̯ | ||
| o | oe̯ | oi̯ | oo̯ | ou̯ |
| u | ui̯ |
Stress
[edit]In standard Bengali, stress is predominantly initial. Bengali words are virtually all trochaic; the primary stress falls on the initial syllable of the word, while secondary stress often falls on all odd-numbered syllables thereafter, giving strings such as in সহযোগিতা shô-hô-jo-gi-ta "cooperation", where the boldface represents primary and secondary stress.
Consonant clusters
[edit]Native Bengali words do not allow initial consonant clusters;[84] the maximum syllabic structure is CVC (i.e., one vowel flanked by a consonant on each side). Many speakers of Bengali restrict their phonology to this pattern, even when using Sanskrit or English borrowings, such as গেরাম geram (CV.CVC) for গ্রাম gram (CCVC) "village" or ইস্কুল iskul (VC.CVC) for স্কুল skul (CCVC) "school".
Writing system
[edit]

The Bengali-Assamese script is an abugida, a script with letters for consonants, with diacritics for vowels, and in which an inherent vowel (অ ô) is assumed for consonants if no vowel is marked.[85] The Bengali alphabet is used throughout Bangladesh and eastern India (Assam, West Bengal, Tripura). The Bengali alphabet is believed to have evolved from a modified Brahmic script around 1000 CE (or 10th–11th century).[86] It is a cursive script with eleven graphemes or signs denoting nine vowels and two diphthongs, and thirty-nine graphemes representing consonants and other modifiers.[86] There are no distinct upper and lower case letter forms. The letters run from left to right and spaces are used to separate orthographic words. Bengali script has a distinctive horizontal line running along the tops of the graphemes that links them together called মাত্রা matra.[87]
Since the Bengali script is an abugida, its consonant graphemes usually do not represent phonetic segments, but carry an "inherent" vowel and thus are syllabic in nature. The inherent vowel is usually a back vowel, either [ɔ] as in মত [mɔt] "opinion" or [o], as in মন [mon] "mind", with variants like the more open [ɒ]. To emphatically represent a consonant sound without any inherent vowel attached to it, a special diacritic, called the hôsôntô (্), may be added below the basic consonant grapheme (as in ম্ [m]). This diacritic, however, is not common and is chiefly employed as a guide to pronunciation. The abugida nature of Bengali consonant graphemes is not consistent, however. Often, syllable-final consonant graphemes, though not marked by a hôsôntô, may carry no inherent vowel sound (as in the final ন in মন [mon] or the medial ম in গামলা [ɡamla]).
A consonant sound followed by some vowel sound other than the inherent [ɔ] is orthographically realised by using a variety of vowel allographs above, below, before, after, or around the consonant sign, thus forming the ubiquitous consonant-vowel typographic ligatures. These allographs, called কার kar, are diacritical vowel forms and cannot stand on their own. For example, the graph মি [mi] represents the consonant [m] followed by the vowel [i], where [i] is represented as the diacritical allograph ি (called ই-কার i-kar) and is placed before the default consonant sign. Similarly, the graphs মা [ma], মী [mi], মু [mu], মূ [mu], মৃ [mri], মে [me~mɛ], মৈ [moj], মো [mo] and মৌ [mow] represent the same consonant ম combined with seven other vowels and two diphthongs. In these consonant-vowel ligatures, the so-called "inherent" vowel [ɔ] is first expunged from the consonant before adding the vowel, but this intermediate expulsion of the inherent vowel is not indicated in any visual manner on the basic consonant sign ম [mɔ].
The vowel graphemes in Bengali can take two forms: the independent form found in the basic inventory of the script and the dependent, abridged, allograph form (as discussed above). To represent a vowel in isolation from any preceding or following consonant, the independent form of the vowel is used. For example, in মই [moj] "ladder" and in ইলিশ [iliʃ] "Hilsa fish", the independent form of the vowel ই is used (cf. the dependent formি). A vowel at the beginning of a word is always realised using its independent form.
In addition to the inherent-vowel-suppressing hôsôntô, three more diacritics are commonly used in Bengali. These are the superposed chôndrôbindu (ঁ), denoting a suprasegmental for nasalisation of vowels (as in চাঁদ [tʃãd] "moon"), the postposed ônusbar (ং) indicating the velar nasal [ŋ] (as in বাংলা [baŋla] "Bengali") and the postposed bisôrgô (ঃ) indicating the voiceless glottal fricative [h] (as in উঃ! [uh] "ouch!") or the gemination of the following consonant (as in দুঃখ [dukʰːɔ] "sorrow").
The Bengali consonant clusters (যুক্তব্যঞ্জন juktôbênjôn) are usually realised as ligatures, where the consonant which comes first is put on top of or to the left of the one that immediately follows. In these ligatures, the shapes of the constituent consonant signs are often contracted and sometimes even distorted beyond recognition. As in, ক্ষ (ক+ষ) or হ্ম (হ+ম) In the Bengali writing system, there are nearly 285 such ligatures denoting consonant clusters. Although there exist a few visual formulas to construct some of these ligatures, many of them have to be learned by rote. Recently, in a bid to lessen this burden on young learners, efforts have been made by educational institutions in the two main Bengali-speaking regions (West Bengal and Bangladesh) to address the opaque nature of many consonant clusters, and as a result, modern Bengali textbooks are beginning to contain more and more "transparent" graphical forms of consonant clusters, in which the constituent consonants of a cluster are readily apparent from the graphical form. However, since this change is not as widespread and is not being followed as uniformly in the rest of the Bengali printed literature, today's Bengali-learning children will possibly have to learn to recognise both the new "transparent" and the old "opaque" forms, which ultimately amounts to an increase in learning burden.
Bengali punctuation marks, apart from the downstroke । daṛi – the Bengali equivalent of a full stop – have been adopted from Western scripts and their usage is similar.[88]
Unlike in Western scripts (Latin, Cyrillic, etc.) where the letter forms stand on an invisible baseline, the Bengali letter-forms instead hang from a visible horizontal left-to-right headstroke called মাত্রা matra. The presence and absence of this matra can be important. For example, the letter ত tô and the numeral ৩ "3" are distinguishable only by the presence or absence of the matra, as is the case between the consonant cluster ত্র trô and the independent vowel এ e, also the letter হ hô and Bengali Ôbogroho ঽ (~ô) and letter ও o and consonant cluster ত্ত ttô. The letter-forms also employ the concepts of letter-width and letter-height (the vertical space between the visible matra and an invisible baseline).
There is yet to be a uniform standard collating sequence (sorting order of graphemes to be used in dictionaries, indices, computer sorting programs, etc.) of Bengali graphemes. Experts in both Bangladesh and India are currently working towards a common solution for this problem.
Alternative and historic scripts
[edit]
Throughout history, there have been instances of the Bengali language being written in different scripts, though these employments were never popular on a large scale and were communally limited. Owing to Bengal's geographic location, Bengali areas bordering non-Bengali regions have been influenced by each other. Small numbers of people in Midnapore, which borders Odisha, have used the Odia script to write in Bengali. In the border areas between West Bengal and Bihar, some Bengali communities historically wrote Bengali in Devanagari, Kaithi and Tirhuta.[89]
In Sylhet and Bankura, modified versions of the Kaithi script had some historical prominence, mainly among Muslim communities. The variant in Sylhet was identical to the Baitali Kaithi script of Hindustani with the exception of Sylhet Nagri possessing matra.[90] Sylhet Nagri was standardised for printing in c. 1869.[10]
Up until the 19th century, numerous variations of the Arabic script had been used across Bengal from Chittagong in the east to Meherpur in the west.[91][92][93] The 14th-century court scholar of Bengal, Nur Qutb Alam, composed Bengali poetry using the Persian alphabet.[94][95] After the Partition of India in the 20th century, the Pakistani government attempted to institute the Perso-Arabic script as the standard for Bengali in East Pakistan; this was met with resistance and contributed to the Bengali language movement.[96]
In the 16th century, Portuguese missionaries began a tradition of using the Roman alphabet to transcribe the Bengali language. Though the Portuguese standard did not receive much growth, a few Roman Bengali works relating to Christianity and Bengali grammar were printed as far as Lisbon in 1743. The Portuguese were followed by the English and French respectively, whose works were mostly related to Bengali grammar and transliteration. The first version of the Aesop's Fables in Bengali was printed using Roman letters based on English phonology by the Scottish linguist John Gilchrist. Consecutive attempts to establish a Roman Bengali have continued across every century since these times, and have been supported by the likes of Suniti Kumar Chatterji, Muhammad Qudrat-i-Khuda, and Muhammad Enamul Haq.[97] The Digital Revolution has also played a part in the adoption of the English alphabet to write Bengali,[98] with certain social media influencers publishing entire novels in Roman Bengali.[99]
Bengali script like others does have Schwa deletion. It does not mark when the inherent vowel is not used (mainly at the end of words)
Orthographic depth
[edit]The Bengali script in general has a comparatively shallow orthography when compared to the Latin script used for English and French, i.e., in many cases there is a one-to-one correspondence between the sounds (phonemes) and the letters (graphemes) of Bengali. But grapheme-phoneme inconsistencies do occur in many other cases. In fact, Bengali-Assamese script has the deepest orthography (deep orthography) among the Indian scripts. In general, the Bengali-Assamese script is fairly transparent for grapheme-to-phoneme conversion, i.e., it is easier to predict the pronunciation from spelling of the words. But the script is fairly opaque for phoneme-to-grapheme conversion, i.e., it is more difficult to predict the spelling from the pronunciation of the words.
One kind of inconsistency is due to the presence of several letters in the script for the same sound. In spite of some modifications in the 19th century, the Bengali spelling system continues to be based on the one used for Sanskrit,[88] and thus does not take into account some sound mergers that have occurred in the spoken language. For example, there are three letters (শ, ষ, and স) for the voiceless postalveolar fricative [ʃ], although the letter স retains the voiceless alveolar sibilant [s] sound when used in certain consonant conjuncts as in স্খলন [skʰɔlon] "fall", স্পন্দন [spɔndon] "beat", etc. The letter ষ also, sometimes, retains the voiceless retroflex sibilant [ʂ] sound when used in certain consonant conjuncts as in কষ্ট [kɔʂʈo] "suffering", গোষ্ঠী [ɡoʂʈʰi] "clan", etc. Similarly, there are two letters (জ and য) for the voiced postalveolar affricate [dʒ]. Moreover, what was once pronounced and written as a retroflex nasal ণ [ɳ] is now pronounced as an alveolar [n] when in conversation (the difference is heard when reading) (unless conjoined with another retroflex consonant such as ট, ঠ, ড and ঢ), although the spelling does not reflect this change. The near-open front unrounded vowel [æ] is orthographically realised by multiple means, as seen in the following examples: এত [æto] "so much", এ্যাকাডেমী [ækaɖemi] "academy", অ্যামিবা [æmiba] "amoeba", দেখা [dækʰa] "to see", ব্যস্ত [bæsto] "busy", ব্যাকরণ [bækorɔn] "grammar".
Another kind of inconsistency is concerned with the incomplete coverage of phonological information in the script. The inherent vowel attached to every consonant can be either [ɔ] or [o] depending on vowel harmony (স্বরসঙ্গতি) with the preceding or following vowel or on the context, but this phonological information is not captured by the script, creating ambiguity for the reader. Furthermore, the inherent vowel is often not pronounced at the end of a syllable, as in কম [kɔm] "less", but this omission is not generally reflected in the script, making it difficult for the new reader.
Many consonant clusters have different sounds than their constituent consonants. For example, the combination of the consonants ক্ [k] and ষ [ʂ] is graphically realised as ক্ষ and is pronounced [kkʰo] (as in রুক্ষ [rukkʰo] "coarse"), [kʰɔ] (as in ক্ষমতা [kʰɔmota] "capability") or even [kʰo] (as in ক্ষতি [kʰoti] "harm"), depending on the position of the cluster in a word. Another example is that there are around 7 or more graphemes to represent the sound [ʃ]. These are 'শ' as in শব্দ (śabda, pronounced as śôbdo "word"), 'ষ' as in ষড়যন্ত্র (ṣaṛayantra, pronounced as śoṛōjontrō "conspiracy"), 'স' as in সরকার (sarakāra, pronounced as śorkar "government"), 'শ্ব' as in শ্বশুর (written as śbaśura but pronounced with the ব b silent, i.e., as śōśur "father-in-law"), 'শ্ম' as in শ্মশান (written as śmaśāna but pronounced with the ম 'm' silent, i.e., as śośan "crematorium"), 'স্ব' as in স্বপ্ন (written as "sbapna" but pronounced with the ব 'b' silent, i.e., as śopnō "dream"), 'স্ম' as in স্মরণ (written as smaraṇa but pronounced with the ম 'm' silent, i.e., as śorōn "remembrance"), 'ষ্ম' as in গ্রীষ্ম (written as grīṣma but pronounced with the ম 'm' silent, i.e., as griśśō "summer") and so on. In most of the consonant clusters, only the first consonant is pronounced and rest of the consonants are silent. Examples are লক্ষ্মণ (written as lakṣmaṇa but pronounced as lokkhōn "Lakshman"), বিশ্বাস (written as biśbāsa but pronounced as biśśaś "belief"), বাধ্য (written as bādhya but pronounced as baddhō "obliged") and স্বাস্থ্য (written as sbāsthya but pronounced as śasthō "health"). Some consonant clusters have completely different pronunciation as compared to the constituent consonants. For example, 'হ্য' as in ঐতিহ্য (meaning "heritage") where hy is pronounced as jjh (written as aitihya but pronounced as ōitijjhō). The same হ্য is pronounced as 'hæ' as in হ্যাঁ (meaning "yes") (written as hyām̐ but pronounced as nasalised "hæ").
Another example of inconsistency in the script is that of words like, অন্য (written as anya but pronounced as ōnnō "other, different") and অন্ন (written as anna but pronounced as onnō "cooked rice, food"); in these words, the letter অ is combining with two different consonant clusters ন্য (nya) and ন্ন (nna), and while the same letter অ has two different pronunciations, ō and o, the two different consonant clusters have the same pronunciation. Thus, same letters and graphemes can often have different pronunciations depending on their position in a word and different graphemes and letters often have the same pronunciation.
The main reason for these numerous inconsistencies is that there have been lots of sound mergers in Bengali, but the script has failed to account for the sound shifts and consonant mergers in the language. Bengali has lots of tatsam words (words directly derived from Sanskrit) and in all these words, the original spelling has been preserved but the pronunciations have changed due to consonant mergers and sound shifts. In fact, most of the tatsam words have many grapheme-to-phoneme inconsistencies while most of the tadbhav words (native Bengali words) have fairly consistent grapheme-to-phoneme correspondence. The Bengali writing system is, therefore, not often a true guide to pronunciation.
Uses
[edit]The script used for Bengali, Assamese, and other languages is known as Bengali script. The script is known as the Bengali alphabet for Bengali and its dialects and the Assamese alphabet for Assamese language with some minor variations. Other related languages in the nearby region also make use of the Bengali script like the Meitei language in the Indian state of Manipur, where the Meitei language has been written in the Bengali script for centuries, though the Meitei script has been promoted in recent times.
Number system
[edit]Bengali digits are as follows:
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|---|
| ০ | ১ | ২ | ৩ | ৪ | ৫ | ৬ | ৭ | ৮ | ৯ |
Some 19th-century grammars note additional signs for fractions, quarters and sixteenths in particular.[100][101]
Romanisation
[edit]There are various romanisation systems used for Bengali created in recent years which have failed to represent the true Bengali phonetic sound. The Bengali alphabet has often been included with the group of Brahmic scripts for romanisation where the true phonetic value of Bengali is never represented. Some of them are the International Alphabet of Sanskrit Transliteration, or IAST system (based on diacritics);[102] "Indian languages Transliteration", or ITRANS (uses upper case letters suited for ASCII keyboards);[103] and the National Library at Kolkata romanisation.[104]
In the context of Bengali romanisation, it is important to distinguish transliteration from transcription. Transliteration is orthographically accurate (i.e. the original spelling can be recovered), whereas transcription is phonetically accurate (the pronunciation can be reproduced). As the spelling often doesn't reflect the actual pronunciation, transliteration and transcription are often different.
Although it might be desirable to use a transliteration scheme where the original Bengali orthography is recoverable from the Latin text, Bengali words are currently romanised on Wikipedia using a phonemic transcription, where the true phonetic pronunciation of Bengali is represented with no reference to how it is written.
The most recent attempt has been by publishers Mitra and Ghosh with the launch of three popular children's books, Abol Tabol, Hasi Khusi and Sahoj Path, in Roman script at the Kolkata Book Fair 2018. Published under the imprint of Benglish Books, these are based on phonetic transliteration and closely follow spellings used in social media but for using an underline to describe soft consonants.
Grammar
[edit]Bengali nouns are not assigned gender, which leads to minimal changing of adjectives (inflection). However, nouns and pronouns are moderately declined (altered depending on their function in a sentence) into four cases while verbs are heavily conjugated, and the verbs do not change form depending on the number of the noun.
Word order
[edit]As a head-final language, Bengali follows a subject–object–verb word order, although variations on this theme are common.[105] Bengali makes use of postpositions, as opposed to the prepositions used in English and other European languages. Determiners follow the noun, while numerals, adjectives, and possessors precede the noun.[106]
Yes–no questions do not require any change to the basic word order; instead, the low (L) tone of the final syllable in the utterance is replaced with a falling (HL) tone. Additionally, optional particles (e.g. কি -ki, না -na, etc.) are often encliticised onto the first or last word of a yes–no question.
Wh-questions are formed by fronting the wh-word to focus position, which is typically the first or second word in the utterance.
Nouns
[edit]Nouns and pronouns are inflected for case, including nominative, objective, genitive (possessive), and locative.[29] The case marking pattern for each noun being inflected depends on the noun's degree of animacy. When a definite article such as -টা -ṭa (singular) or -গুলো -gulo (plural) is added, as in the tables below, nouns are also inflected for number.
In most of Bengali grammar books, cases are divided into 6 categories and an additional possessive case (the possessive form is not recognised as a type of case by Bengali grammarians). But in terms of usage, cases are generally grouped into only 4 categories.
| Animate | Inanimate | |||
|---|---|---|---|---|
| Singular | Plural | Singular | Plural | |
| Nominative | ছাত্রটি chatrô-ṭi the student |
ছাত্ররা chatrô-ra /
ছাত্রগণ
the students |
জুতোটা juto-ṭa the shoe |
জুতাগুলা juta-gula / / জুতোগুলো juto-gulo the shoes |
| Objective | ছাত্রটিকে chatrô-ṭi-ke the student |
ছাত্রদের(কে) chatrô-der(ke) the students |
জুতোটা juto-ṭa the shoe |
জুতাগুলা juta-gula / / জুতোগুলো juto-gulo the shoes |
| Genitive | ছাত্রটির chatrô-ṭi-r the student's |
ছাত্রদের chatrô-der the students' |
জুতোটার juto-ṭa-r the shoe's |
জুতাগুলা juta-gula / / জুতোগুলোর juto-gulo-r the shoes' |
| Locative | – | – | জুতোটায় juto-ṭa-y on/in the shoe |
জুতাগুলা juta-gula / / জুতোগুলোতে juto-gulo-te on/in the shoes |
When counted, nouns take one of a small set of measure words. Nouns in Bengali cannot be counted by adding the numeral directly adjacent to the noun. An appropriate measure word (MW), a classifier, must be used between the numeral and the noun (most languages of the Mainland Southeast Asia linguistic area are similar in this respect). Most nouns take the generic measure word -টা -ṭa, though other measure words indicate semantic classes (e.g. -জন -jôn for humans). There is also the classifier -khana, and its diminutive form -khani, which attaches only to nouns denoting something flat, long, square, or thin. These are the least common of the classifiers.[107]
নয়টা Nôy-ṭa nine-MW গরু goru cow Nine cows |
কয়টা Kôy-ṭa how many-MW বালিশ balish pillow How many pillows |
অনেকজন Ônek-jôn many-MW লোক lok person Many people |
চার-পাঁচজন Ĉar-pãc-jôn four-five-MW শিক্ষক shikkhôk teacher Four to five teachers |
Measuring nouns in Bengali without their corresponding measure words (e.g. আট বিড়াল aṭ biṛal instead of আটটা বিড়াল aṭ-ṭa biṛal "eight cats") would typically be considered ungrammatical. However, when the semantic class of the noun is understood from the measure word, the noun is often omitted and only the measure word is used, e.g. শুধু একজন থাকবে। Shudhu êk-jôn thakbe. (lit. 'Only one-'MW will remain.") would be understood to mean "Only one person will remain.", given the semantic class implicit in -জন -jôn.
In this sense, all nouns in Bengali, unlike most other Indo-European languages, are similar to mass nouns.
Verbs
[edit]There are two classes of verbs: finite and non-finite. Non-finite verbs have no inflection for tense or person, while finite verbs are fully inflected for person (first, second, third), tense (present, past, future), aspect (simple, perfect, progressive), and honour (intimate, familiar, and formal), but not for number. Conditional, imperative, and other special inflections for mood can replace the tense and aspect suffixes. The number of inflections on many verb roots can total more than 200.
Inflectional suffixes in the morphology of Bengali vary from region to region, along with minor differences in syntax.
Bengali differs from most Indo-Aryan Languages in the zero copula, where the copula or connective be is often missing in the present tense.[88] Thus, "he is a teacher" is তিনি শিক্ষক se shikkhôk, (literally "he teacher").[108] In this respect, Bengali is similar to Russian and Hungarian. Romani grammar is also the closest to Bengali grammar.[109]
Vocabulary
[edit]- Tadbhavas in Bengali (Inherited Indo-Aryan vocabulary) (16.0%)
- Tatsamas in Bengali (Direct borrowings from Sanskrit) (40.0%)
- Native Words (Indigenous, "Desi" words) (16.0%)
- Foreign Loanwords (words originating from Persian, Turkish, Arabic, English, Portuguese, etc.) (28.0%)
Bengali is typically thought to have around 100,000 separate words, of which 16,000 (16%) are considered to be তদ্ভব tôdbhôbô, or Tadbhava (inherited Indo-Aryan vocabulary), 40,000 (40%) are তৎসম tôtśômô or Tatsama (words directly borrowed from Sanskrit), and borrowings from দেশী deśi, or "indigenous" words, which are at around 16,000 (16%) of the Bengali vocabulary. The rest are বিদেশী bideśi or "foreign" sources, including Persian, Turkish, Arabic, and English among others, accounting for around 28,000 (28%) of all Bengali words, highlighting the significant influence that foreign languages and cultures have had on the Bengali language throughout Bengal's long history of contact with different peoples and the cultural exchanges that came with such interactions.[110] Bengali is reportedly similar to Assamese and has a lexical similarity of 40 per cent with Nepali.[111]
According to Suniti Kumar Chatterji, dictionaries from the early 20th century attributed a little more than 50% of the Bengali vocabulary to native words (i.e., naturally modified Sanskrit words, corrupted forms of Sanskrit words, and loanwords non-Indo-European languages). About 45% per cent of Bengali words are unmodified Sanskrit, and the remaining words are from foreign languages.[112] However, more modern sources cite that this is not the case with Bengali vocabulary, as there are far more dominant foreign influences that accurately reflect the way modern Bengalis speak and utilise Bengali.[113] Persian is also thought to have influenced many grammatical forms.[114] More recent studies suggest that the use of foreign words has been increasing, mainly because of the preference of Bengali speakers for the colloquial style.[112] Because of centuries of contact with Europeans, Turkic peoples, and Persians, Bengali has absorbed numerous words from foreign languages, often totally integrating these borrowings into the core vocabulary.
Persian influence was significant for the development of Bengali up to the modern day, and was the primary official language in the region for 600 years, until British rule, when it was changed to English in 1836. In fact, there was so much Persian influence that a register of highly Persianized Bengali, known as Dobhashi appeared in medieval Bengal.[115]
The most common borrowings from foreign languages come from three types of contact. After close contact with several indigenous Austroasiatic languages,[116][117][118][119] and later the Delhi Sultanate, the Bengal Sultanate, and the Mughal Empire, whose court language was Persian, numerous Arabic, Persian, and Chaghatai words were absorbed into the lexicon.[45]
Later, East Asian travellers and lately European colonialism brought words from Portuguese, French, Dutch, and most significantly English during the colonial period.[citation needed]
Sample text
[edit]The following is a sample text in Bengali of Article 1 of the Universal Declaration of Human Rights:
সমস্ত
Sômôstô
ʃɔmosto
All
মানুষ
manush
manuʃ
human
স্বাধীনভাবে
shadhinbhabe
ʃadʱinbʱabe
free-manner-in
সমান
sôman
ʃoman
equal
মর্যাদা
môrjada
mɔɾdʒada
dignity
এবং
ebông
eboŋ
and
অধিকার
ôdhikar
odʱikaɾ
right
নিয়ে
niye
nie̯e
taken
জন্মগ্রহণ
jônmôgrôhôn
dʒɔnmoɡrohon
birth-take
করে।
kôre.
kɔɾe
do.
তাঁদের
Tãder
tãdeɾ
Their
বিবেক
bibek
bibek
reason
এবং
ebông
eboŋ
and
বুদ্ধি
buddhi
budʱːi
intelligence
আছে;
achhe;
atʃʰe
exist;
সুতরাং
sutôrang
ʃutoraŋ
therefore
সকলেরই
sôkôleri
ʃɔkoleɾi
everyone-indeed
একে
êke
ɛke
one
অপরের
ôpôrer
ɔporeɾ
another's
প্রতি
prôti
proti
towards
ভ্রাতৃত্বসুলভ
bhratrittôsulôbh
bʱratritːoʃulɔbʱ
brotherhood-ly
মনোভাব
mônobhab
monobʱab
attitude
নিয়ে
niye
nie̯e
taken
আচরণ
achôrôn
atʃorɔn
conduct
করা
kôra
kɔra
do
উচিত।
uchit.
utʃit
should.
All human beings are born free and equal in dignity and rights. They possess conscience and reason. Therefore, everyone should act in a spirit of brotherhood towards each other.
See also
[edit]Notes
[edit]- ^ a b c d e Bengali at Ethnologue (28th ed., 2025)
- ^ "Scheduled Languages in descending order of speaker's strength – 2011" (PDF). Registrar General and Census Commissioner of India. Archived (PDF) from the original on 14 November 2018. Retrieved 28 June 2018.
- ^ a b "Jharkhand gives second language status to Magahi, Angika, Bhojpuri, and Maithili". The Avenue Mail. 21 March 2018. Archived from the original on 28 March 2019. Retrieved 30 April 2019.
- ^ "Bengali". Dictionary.com. Retrieved 7 February 2024.
An Indic language spoken in India and Bangladesh.
- ^ "Bengal". The Chambers Dictionary (9th ed.). Chambers. 2003. ISBN 0-550-10105-5.
- ^ Laurie Bauer, 2007, The Linguistics Student's Handbook, Edinburgh
- ^ "The World Factbook". Central Intelligence Agency. Archived from the original on 26 January 2021. Retrieved 21 February 2018.
- ^ "Summary by language size". Ethnologue. 2019. Archived from the original on 24 April 2019. Retrieved 21 February 2019.
- ^ a b "Bangla Bhasha Procholon Ain, 1987" বাংলা ভাষা প্রচলন আইন, ১৯৮৭ [Bengali Language Implementation Act, 1987] (PDF). Bangladesh Code বাংলাদেশ কোড (in Bengali). Vol. 27 (Online ed.). Dhaka: Ministry of Law, Justice and Parliamentary Affairs, Bangladesh. Archived from the original (PDF) on 5 July 2016. Retrieved 23 May 2019.
- ^ a b "Bangla Language". Banglapedia. Archived from the original on 6 July 2015. Retrieved 12 September 2018.
- ^ a b "The Constitution of the People's Republic of Bangladesh". Ministry of Law, Justice and Parliamentary Affairs. Archived from the original on 10 November 2019. Retrieved 23 May 2019.
- ^ "National Languages Of Bangladesh". einfon.com. 11 June 2017. Archived from the original on 2 August 2017. Retrieved 12 September 2018.
- ^ "5 Surprising Reasons the Bengali Language Is Important". 17 August 2017. Archived from the original on 26 June 2018. Retrieved 10 March 2018.
- ^ "50th Report of the Commissioner for Linguistic Minorities in India (July 2012 to June 2013)" (PDF). National Commission for Linguistic Minorities. 16 July 2014. Archived from the original (PDF) on 2 January 2018. Retrieved 20 February 2018.
- ^ "50th Report of the Commissioner for Linguistic Minorities in India" (PDF). nclm.nic.in. Ministry of Minority Affairs. Archived from the original (PDF) on 8 July 2016. Retrieved 23 May 2019.
- ^ "Bengali Language". www.britannica.com. Archived from the original on 26 June 2018. Retrieved 12 September 2018.
- ^ a b Kumar, Vivek; Roy, Suryagni (3 October 2024). "Marathi, Pali, Prakrit, Assamese, Bengali now among classical languages". India Today. Retrieved 3 October 2024.
- ^ a b "Classical language status: Union Cabinet approves addition of Marathi, Assamese, Bengali, Pali, Prakrit to list". The Hindu. PTI. 3 October 2024. ISSN 0971-751X. Retrieved 3 October 2024.
- ^ —R, Aishwaryaa (6 June 2019). "What census data reveals about use of Indian languages". Deccan Herald. Archived from the original on 16 November 2023. Retrieved 16 November 2023.
—Pallapothu, Sravan (28 June 2018). "Hindi Added 100Mn Speakers In A Decade; Kashmiri 2nd Fast Growing Language". Indiaspend.com. Archived from the original on 16 November 2023. Retrieved 16 November 2023.
—IndiaSpend (2 July 2018). "Hindi fastest growing language in India, finds 100 million new speakers". Business Standard. Archived from the original on 16 November 2023. Retrieved 16 November 2023.
—Mishra, Mayank; Aggarwal, Piyush (11 April 2022). "Hindi grew rapidly in non-Hindi states even without official mandate". India Today. Archived from the original on 16 November 2023. Retrieved 16 November 2023. - ^ "Amendment to the Draft Programme and Budget for 2000–2001 (30 C/5)" (PDF). General Conference, 30th Session, Draft Resolution. UNESCO. 1999. Archived (PDF) from the original on 21 May 2011. Retrieved 27 May 2008.
- ^ "Resolution adopted by the 30th Session of UNESCO's General Conference (1999)". International Mother Language Day. UNESCO. Archived from the original on 1 June 2008. Retrieved 27 May 2008.
- ^ (Toulmin 2009:220)
- ^ a b Datta, Amaresh (1988). Encyclopaedia of Indian Literature: Devraj to Jyoti. Sahitya Akademi. p. 1694. ISBN 978-81-260-1194-0. Archived from the original on 2 November 2023. Retrieved 29 October 2023.
- ^ Tuteja, K. L.; Chakraborty, Kaustav (15 March 2017). Tagore and Nationalism. Springer. p. 59. ISBN 978-81-322-3696-2. Archived from the original on 2 November 2023. Retrieved 29 October 2023.
- ^ Emeneau, Murray B.; Fergusson, Charles A. (21 November 2016). Linguistics in South Asia. Walter de Gruyter GmbH & Co KG. p. 680. ISBN 978-3-11-081950-2.
In 'Magadhi Prakrit and Bengali' (1925) he attempted to trace the origin of Bengali to Old Indo-Aryan through the Gaudi Prakrit. In this study Shahidullah disagreed with Grierson, Hoernle, Chatterji, and other authorities. Shahidullah's findings were later supported by A. B. Keith and others.
- ^ Shariful Islam (2012). "Bangla Script". In Sirajul Islam; Miah, Sajahan; Khanam, Mahfuza; Ahmed, Sabbir (eds.). Banglapedia: the National Encyclopedia of Bangladesh (Online ed.). Dhaka, Bangladesh: Banglapedia Trust, Asiatic Society of Bangladesh. ISBN 984-32-0576-6. OCLC 52727562. OL 30677644M. Retrieved 3 November 2025.
- ^ Shah 1998, p. 11
- ^ Keith 1998, p. 187
- ^ a b (Bhattacharya 2000)
- ^ Mitra, Atri (8 October 2024). "Behind Bengali's classical language tag, a Kolkata institute's 2,000-page research document". The Indian Express. Kolkata. Retrieved 24 February 2025.
And the West Bengal government-run organisation that prepared the document, the Institute of Language Studies and Research (ILSR), is delighted its more than a year-long efforts have paid off...."…a Chinese-Sanskrit Dictionary compiled by Li-Yen bears testimony to the fact that at least 51 Bangla words made their way into that dictionary…The Chinese-Sanskrit dictionary, compiled in the 8th Century CE, included or rather was compelled to include words of a third language, i.e., Bangla. This is indicative of the fact that Bangla had already been standardized and was one of the influential tongues which was regarded as the lingua franca of the geographic area that was known as Bengal,"....
- ^ Das, Amitava; Sanyal, Rajat; Chakraborty, Rajib, eds. (2025). Classical Bangla. Kolkata: Institute of Language Studies and Research (ILSR), Kolkata. pp. 332―337. ISBN 9788198479358.
The Chinese-Sanskrit dictionary, compiled in the 8th Century CE, included or rather was compelled to include words of a third language, i.e., Bangla. This is indicative of the fact that Bangla had already been standardized and was one of the influential tongues which was regarded as the lingua franca of the geographic area that was known as Bengal. This illustrates with absolute certainty that Bangla must have been the language of communication for several centuries before the time of composition of Carya songs. By the time Carya songs were composed, Bangla had already acquired prosodic as well as rhetoric sophistication to be considered as a medium of literary discourse.
- ^ "Within the Eastern Indic language family the history of the separation of Bangla from Odia, Assamese, and the languages of Bihar remains to be worked out carefully. Scholars do not yet agree on criteria for deciding if certain tenth century AD texts were in a Bangla already distinguishable from the other languages, or marked a stage at which Eastern Indic had not finished differentiating." (Dasgupta 2003:386–387)
- ^ "Banglapedia". Banglapedia. Archived from the original on 15 May 2020. Retrieved 7 November 2017.
- ^ "Pala dynasty – Indian dynasty". Global.britannica.com. Archived from the original on 5 March 2017. Retrieved 7 November 2017.
- ^ nimmi. "Pala Dynasty, Pala Empire, Pala empire in India, Pala School of Sculptures". Indianmirror.com. Archived from the original on 28 October 2017. Retrieved 7 November 2017.
- ^ a b Bari, Sarah Anjum (12 April 2019). "A Tale of Two Languages: How the Persian language seeped into Bengali". The Daily Star. Retrieved 5 March 2024.
- ^ Majumder, Atindra. Arabic, Persian and Turkish words in Bengali Literature.
- ^ "What is more significant, a contemporary Chinese traveler reported that although Persian was understood by some in the court, the language in universal use there was Bengali. This points to the waning, although certainly not yet the disappearance, of the sort of foreign mentality that the Muslim ruling class in Bengal had exhibited since its arrival over two centuries earlier. It also points to the survival, and now the triumph, of local Bengali culture at the highest level of official society." (Eaton 1993:60)
- ^ Rabbani, AKM Golam (7 November 2017). "Politics and Literary Activities in the Bengali Language during the Independent Sultanate of Bengal". Dhaka University Journal of Linguistics. 1 (1): 151–166. Archived from the original on 11 October 2017. Retrieved 7 November 2017 – via www.banglajol.info.
- ^ Eaton 1993.
- ^ (Eaton 1993:167–174)
- ^ a b "Bengali Language at Cornell". Department of Asian Studies. Cornell University. Archived from the original on 15 November 2012.
- ^ a b Ray, S Kumar. "The Bengali Language and Translation". Translation Articles. Kwintessential. Archived from the original on 25 September 2006. Retrieved 19 November 2006.
- ^ "Bengali language | History, Writing System & Dialects | Britannica". www.britannica.com. 3 March 2024. Retrieved 7 March 2024.
- ^ a b c Thompson, Hanne-Ruth (2012). Bengali (Paperback with corrections. ed.). Amsterdam: John Benjamins Pub. Co. p. 3. ISBN 978-90-272-3819-1.
- ^ "Bengali 'should be UN language'". BBC News. 22 December 2009. Archived from the original on 8 November 2017. Retrieved 7 November 2017.
- ^ "UN adopts Bangla as unofficial language". Dhaka Tribune. 12 June 2022. Archived from the original on 20 January 2023. Retrieved 20 January 2023.
- ^ "Kuwait restricts recruitment of male Bangladeshi workers". Dhaka Tribune. 7 September 2016. Archived from the original on 30 August 2017. Retrieved 4 December 2017.
- ^ "Bahrain: Foreign population by country of citizenship, sex and migration status (worker/ family dependent) (selected countries, January 2015) – GLMM". GLMM. 20 October 2015. Archived from the original on 16 December 2017. Retrieved 4 December 2017.
- ^ "Saudi Arabia". Ethnologue. Archived from the original on 23 November 2017. Retrieved 4 December 2017.
- ^ "New York State Voter Registration Form" (PDF). Archived (PDF) from the original on 27 January 2018. Retrieved 10 February 2018.
- ^ "Bangla Language and Literary Society, Singapore". Archived from the original on 5 May 2013.
- ^ "Languages of India". Ethnologue Report. Archived from the original on 26 December 2009. Retrieved 4 November 2006.
- ^ "Language". Government of Assam. Archived from the original on 26 August 2006. Retrieved 20 June 2006.
- ^ Bhattacharjee, Kishalay (30 April 2008). "It's Indian language vs Indian language". NDTV.com. Archived from the original on 23 July 2013. Retrieved 27 May 2008.
- ^ Syed Yasir Kazmi (16 October 2009). "Pakistani Bengalis". DEMOTIX. Archived from the original on 2 January 2015. Retrieved 2 April 2013.
- ^ "کراچی کے 'بنگالی پاکستانی'(Urdu)". محمد عثمان جامعی. 17 November 2003. Archived from the original on 19 November 2003. Retrieved 2 April 2013.
- ^ Rafiqul Islam. "The Language Movement : An Outline". Archived from the original on 19 November 2012. Retrieved 12 May 2016.
- ^ "Karachi Department of Bengali". Archived from the original on 10 May 2013. Retrieved 2 April 2013.
- ^ "Statement by Foreign Minister on Second Bangladesh-India Track II dialogue at BRAC Centre on 07 August, 2005". Ministry of Foreign Affairs, Government of Bangladesh. Archived from the original on 18 April 2008. Retrieved 27 May 2008.
- ^ "Sri Lanka". The World Factbook. Archived from the original on 17 December 2021. Retrieved 20 September 2017.
- ^ "Man of the series: Nobel laureate Tagore". The Times of India. Times News Network. 3 April 2011. Archived from the original on 12 June 2018. Retrieved 20 June 2019.
- ^ "Sri Lanka I-Day to have anthem in Tamil". The Hindu. 4 February 2016. Archived from the original on 4 February 2016. Retrieved 20 September 2017.
- ^ "Tagore's influence on Lankan culture". Hindustan Times. 12 May 2010. Archived from the original on 11 October 2017. Retrieved 20 September 2017.
- ^ Wickramasinghe, Nira (2003). Dressing the Colonised Body: Politics, Clothing, and Identity in Sri Lanka. Orient Longman. p. 26. ISBN 978-81-250-2479-8. Archived from the original on 29 September 2018. Retrieved 29 September 2018.
- ^ Wickramasinghe, Kamanthi; Perera, Yoshitha. "Sri Lankan National Anthem: can it be used to narrow the gap?". The Daily Mirror (Sri Lanka). No. 30 March 2015. Archived from the original on 11 October 2017. Retrieved 20 September 2017.
- ^ Haque, Junaidul (7 May 2011). "Rabindranath: He belonged to the world". The Daily Star (Bangladesh). Archived from the original on 4 March 2016. Retrieved 20 September 2017.
- ^ Habib, Haroon (17 May 2011). "Celebrating Rabindranath Tagore's legacy". The Hindu. Archived from the original on 13 June 2014. Retrieved 20 September 2017.
- ^ Bhaumik, Subir (22 December 2009). "Bengali 'should be UN language'". BBC News. Archived from the original on 3 April 2012. Retrieved 25 January 2010.
- ^ name="huq_sarkar"
- ^ The Origin and Development of the Bengali language, Suniti kumar Chatterjee, Vol- 1, Page 140, George Allen and Unwin London,New Edition,1970.
- ^ a b Morshed, Abul Kalam Manjoor (2012). "Dialect". In Sirajul Islam; Miah, Sajahan; Khanam, Mahfuza; Ahmed, Sabbir (eds.). Banglapedia: the National Encyclopedia of Bangladesh (Online ed.). Dhaka, Bangladesh: Banglapedia Trust, Asiatic Society of Bangladesh. ISBN 984-32-0576-6. OCLC 52727562. OL 30677644M. Retrieved 3 November 2025.
- ^ Gope, Amalesh; Mahanta, Sakuntala (2014). Lexical Tones in Sylheti (PDF). 4th International Symposium on Tonal Aspects of Languages (TAL-2014). Archived from the original (PDF) on 15 August 2021. Retrieved 27 June 2020.
- ^ "Hajong". The Ethnologue Report. Archived from the original on 15 November 2006. Retrieved 19 November 2006.
- ^ a b Bhasa, Chalita; Huq, Mohammad Daniul (2012). "Bangladesh". In Sirajul Islam; Miah, Sajahan; Khanam, Mahfuza; Ahmed, Sabbir (eds.). Banglapedia: the National Encyclopedia of Bangladesh (Online ed.). Dhaka, Bangladesh: Banglapedia Trust, Asiatic Society of Bangladesh. ISBN 984-32-0576-6. OCLC 52727562. OL 30677644M. Retrieved 3 November 2025.
- ^ a b "History of Bengali (Banglar itihash)". Bengal Telecommunication and Electric Company. Archived from the original on 7 July 2011. Retrieved 20 November 2006.
- ^ Huq, Mohammad Daniul (2012). "Sadhu Bhasa". In Sirajul Islam; Miah, Sajahan; Khanam, Mahfuza; Ahmed, Sabbir (eds.). Banglapedia: the National Encyclopedia of Bangladesh (Online ed.). Dhaka, Bangladesh: Banglapedia Trust, Asiatic Society of Bangladesh. ISBN 984-32-0576-6. OCLC 52727562. OL 30677644M. Retrieved 3 November 2025.
- ^ Huq, Mohammad Daniul (2012). "Alaler Gharer Dulal". In Sirajul Islam; Miah, Sajahan; Khanam, Mahfuza; Ahmed, Sabbir (eds.). Banglapedia: the National Encyclopedia of Bangladesh (Online ed.). Dhaka, Bangladesh: Banglapedia Trust, Asiatic Society of Bangladesh. ISBN 984-32-0576-6. OCLC 52727562. OL 30677644M. Retrieved 3 November 2025.
- ^ a b Ray, Hai & Ray 1966, p. 89
- ^ Ray, Hai & Ray 1966, p. 80
- ^ "A Bilingual Dictionary of Words and Phrases (English-Bengali)". Bengali-dictionary.com. Archived from the original on 25 October 2016. Retrieved 25 October 2016.
- ^ (Masica 1991, pp. 116)
- ^ Sarkar, Pabitra (1985). Bangla diswar dhoni. Bhasa.
{{cite book}}: CS1 maint: location missing publisher (link) - ^ (Masica 1991, pp. 125)
- ^ Escudero Pascual Alberto (23 October 2005). "Writing Systems/ Scripts" (PDF). Primer to Localization of Software. it46.se. Archived from the original (PDF) on 19 March 2009. Retrieved 20 November 2006.
- ^ a b "Bangalah". Archived from the original on 5 July 2015. in Asiatic Society of Bangladesh 2003
- ^ "banglasemantics.net". Archived from the original on 24 December 2010.
- ^ a b c "Bangla language". Archived from the original on 6 July 2015. in Asiatic Society of Bangladesh 2003
- ^ Chatterji (1926), p. 234–235.
- ^ Saha, RN (1935). "The Origin of the Alphabet and Numbers". In Khattry, DP (ed.). Report of All Asia Educational Conference (Benares, December 26–30, 1930). Allahabad, India: The Indian Press Ltd. pp. 751–779.
- ^ Chatterji (1926), pp. 228–233.
- ^ Khan Sahib, Maulavi Abdul Wali (2 November 1925). A Bengali Book written in Persian Script.
- ^ Ahmad, Qeyamuddin (20 March 2020). The Wahhabi Movement in India. Routledge.
- ^ "The development of Bengali literature during Muslim rule" (PDF). Blogs.edgehill.ac.uk. Archived from the original (PDF) on 9 August 2017. Retrieved 7 November 2016.
- ^ Shahidullah, Muhammad (February 1963). "হযরত নূরুদ্দীন নূরুল হক নূর কুতবুল আলম (রহঃ)". Islam Prasanga ed. 1st ইসলাম প্রসঙ্গ (in Bengali) (1 ed.). Dacca: Mawla Brothers. p. 99.
- ^ Kurzon, Dennis (2010). "Romanisation of Bengali and Other Indian Scripts". Journal of the Royal Asiatic Society. 20 (1): 71–73. ISSN 1356-1863. JSTOR 27756124. Archived from the original on 23 August 2021. Retrieved 23 August 2021.
- ^ Chatterji (1926), pp. 233–234.
- ^ Kurzon, Dennis (2009). Romanisation of Bengali and Other Indian Scripts (Thesis). Cambridge University.
- ^ Islam, Tahsina (18 September 2019). "The question of standard Bangla". The Independent. Dhaka. Archived from the original on 12 May 2021. Retrieved 12 May 2021.
- ^ Yates, William (1847). Wenger, John (ed.). A Bengali Grammar. Vol. I. Calcutta: Baptist Mission Press. p. 120.
- ^ Beames, John (1891). Grammar of the Bengali language: Literary and Colloquial. Oxford: Clarendon Press. p. 4.
- ^ "Learning International Alphabet of Sanskrit Transliteration". Sanskrit 3 – Learning transliteration. Gabriel Pradiipaka & Andrés Muni. Archived from the original on 12 February 2007. Retrieved 20 November 2006.
- ^ "ITRANS – Indian Language Transliteration Package". Avinash Chopde. Archived from the original on 23 January 2013. Retrieved 20 November 2006.
- ^ "Annex-F: Roman Script Transliteration" (PDF). Indian Standard: Indian Script Code for Information Interchange – ISCII. Bureau of Indian Standards. 1 April 1999. p. 32. Archived (PDF) from the original on 16 July 2012. Retrieved 20 November 2006.
- ^ (Bhattacharya 2000, pp. 16)
- ^ "Bengali". UCLA Language Materials project. University of California, Los Angeles. Archived from the original on 15 July 2007. Retrieved 20 November 2006.
- ^ Boyle David, Anne (2015). Descriptive grammar of Bangla. De Gruyter. pp. 141–142.
- ^ Among Bengali speakers brought up in neighbouring linguistic regions (e.g. Hindi), the lost copula may surface in utterances such as she shikkhôk hocche. This is viewed as ungrammatical by other speakers, and speakers of this variety are sometimes (humorously) referred as "hocche-Bangali".
- ^ Hübschmannová, Milena (1995). "Romaňi čhib – romština: Několik základních informací o romském jazyku". Bulletin Muzea Romské Kultury (4/1995). Brno.
Zatímco romská lexika je bližší hindštině, marvárštině, pandžábštině atd., v gramatické sféře nacházíme mnoho shod s východoindickým jazykem, s bengálštinou.
- ^ Dash, Niladri Sekhar (15 January 2015). A Descriptive Study of Bengali Words. Cambridge University Press. ISBN 978-1-316-22268-3.
- ^ "Bengali". Ethnologue. Archived from the original on 9 March 2023. Retrieved 19 November 2023.
- ^ a b "Bengali language". Archived from the original on 11 October 2016. Retrieved 2 September 2016.
- ^ "Persian in Bengali. : languagehat.com". languagehat.com. Retrieved 24 May 2024.
- ^ Billah, Abu M. M. A. "Lexicographic affinities between Persian and Bengali language". Academia.
- ^ "Persian - Banglapedia". en.banglapedia.org. Retrieved 6 October 2025.
- ^ Byomkes Chakrabarti A Comparative Study of Santali and Bengali, K.P. Bagchi & Co., Kolkata, 1994, ISBN 81-7074-128-9
- ^ Das, Khudiram (1998). Santhali Bangla Samashabda Abhidhan. Kolkata, India: Paschim Banga Bangla Akademi.
- ^ "Bangla santali vasa samporko" (PDF). Archived (PDF) from the original on 1 March 2017. Retrieved 16 March 2017.
- ^ Das, Khudiram. Bangla Santali Bhasa Samporko (eBook).
References
[edit]
|
|
Further reading
[edit]- Thompson, Hanne-Ruth (2012). Bengali. Volume 18 of London Oriental and African Language Library. John Benjamins Publishing. ISBN 90-272-7313-8.
External links
[edit]Bengali language
View on GrokipediaLinguistic Classification
Indo-Aryan Family Placement
Bengali is a New Indo-Aryan language within the Indo-Aryan branch of the Indo-Iranian languages, part of the Indo-European family. It specifically occupies the Eastern Indo-Aryan subgroup, characterized by shared historical developments from Eastern Middle Indo-Aryan vernaculars.[8][9] The language traces its origins to Magadhi Prakrit, a Middle Indo-Aryan form prevalent from roughly 600 BCE to 600 CE in the ancient Magadha kingdom (modern Bihar and adjacent areas), which influenced eastern regions including Bengal. This Prakrit transitioned into Magadhi Apabhramsa around 600–1200 CE, from which Bengali and related varieties emerged as distinct New Indo-Aryan forms by approximately 900–1000 CE.[5] Eastern Indo-Aryan languages, including Bengali, Assamese, Odia, and Maithili, form a genetic cluster supported by lexicostatistical data showing high lexical retention and common innovations separating them from Northern, Western, or Southern Indo-Aryan groups. Divergence within this Eastern cluster is estimated around 1000–1200 CE, reflecting post-Middle Indo-Aryan fragmentation.[9]Comparative Relations
Bengali forms part of the Eastern Indo-Aryan branch, which derives from Magadhi Prakrit, distinguishing it from the Central and Western Indo-Aryan groups exemplified by Hindi and Gujarati through shared innovations in phonology and morphology. The closest linguistic relatives include Assamese and Odia, with which Bengali shares high lexical overlap and partial mutual intelligibility, stemming from common Prakrit ancestry and geographic proximity.[10] Lexical similarity between Bengali and Assamese reaches 80-90%, enabling substantial comprehension, especially in written registers where the scripts align closely until recent divergences.[11] Odia exhibits around 70% similarity, with mutual intelligibility limited by phonological shifts, such as Odia's retention of more conservative vowel qualities absent in Bengali's rounded inherent vowel /ɔ/.[10] In contrast, similarity with Hindi drops to 50-60%, hampered by divergent phonemic inventories—Bengali features post-alveolar fricatives like /ʂ/ and lacks Hindi's breathy-voiced stops in core lexicon—and grammatical gender marking, which Bengali has entirely abandoned.[10] Grammatically, Bengali aligns with other Eastern Indo-Aryan languages in analytic tendencies, relying on postpositions over inflectional cases and eschewing verb agreement for number or gender, unlike the synthetic residues in Hindi where nouns retain masculine-feminine distinctions. Vocabulary comparisons reveal Bengali's core tadbhava terms from Prakrit evolving similarly to Assamese, but with greater Perso-Arabic adstratum (about 10-15% of lexicon) from medieval Islamic administration, contrasting Hindi's heavier Sanskrit tatsama revival post-19th century.[5] Phonological hallmarks include Bengali's merger of Sanskrit sibilants into /ʃ/ and intervocalic stop lenition to /h/ or glides, processes paralleled in Assamese but less pronounced in Odia or Hindi, where retroflex series persist more robustly.[12] These relations underscore Bengali's position as a bridge between conservative Indo-Aryan retention and innovative simplification in the east.Historical Development
Old Bengali Period (Pre-1200 CE)
The Old Bengali period, extending prior to 1200 CE, represents the initial stages of Bengali's divergence as an Eastern Indo-Aryan language from Magadhi Prakrit and Apabhramsha forms prevalent in the Bengal region. Linguistic evolution during this era involved phonological shifts such as the simplification of consonant clusters and the emergence of distinctive vowel patterns, setting Bengali apart from neighboring Western and Southern Indo-Aryan branches. These changes occurred amid the cultural dominance of the Pala dynasty (c. 750–1174 CE), which patronized Mahayana Buddhism and facilitated the transition from Sanskrit-centric literacy to vernacular expressions.[4][13] The earliest surviving literary attestation is the Charyapada, a corpus of 47 Buddhist tantric songs (padyavali) attributed to siddhacaryas like Luipada and Kanhapada, composed between the 8th and 12th centuries. Written in a cryptic Abahatta dialect using sandhya bhasa (esoteric twilight language), these verses blend Sanskrit roots with proto-Bengali morphology, including apocope of final vowels and pleonastic matras, evidencing an early form of the language spoken in eastern India. The manuscript, containing verses 1–47, was rediscovered in 1907 in Nepal's royal library, with paleographic analysis dating the palm-leaf original to the late 11th or early 12th century, though compositions likely span earlier Pala-era monastic centers in Bengal and Bihar.[14][15] Epigraphic records from this period remain predominantly in Sanskrit, employing a proto-Bengali script derived from the Gupta era's eastern variants, characterized by circular letter forms and ligature simplifications that prefigure the modern Bengali-Assamese abugida. The Pala rulers issued copper-plate grants, such as those from Dharmapala (r. 770–810 CE), primarily in Sanskrit but inscribed in this evolving script, hinting at vernacular influence in administrative contexts. Silver coins from the Harikela kingdom (c. 9th–10th centuries) in southeastern Bengal feature legends in proto-Bengali script, marking one of the earliest numismatic uses of the language for royal titles and minting details, reflecting trade and local governance needs.[13][16] This era's linguistic output was constrained by the oral tradition and elite preference for Sanskrit, with proto-Bengali confined largely to Buddhist siddha poetry and marginal inscriptions. The scarcity of texts underscores a transitional phase where the language consolidated its identity through regional Prakrit substrates and Aryan superstrates, laying foundations for later medieval developments under Islamic influences post-1200 CE.[4]Middle Bengali Period (1200–1800 CE)
The Middle Bengali period, spanning approximately 1200 to 1800 CE, commenced following the Turkish conquest of Bengal by Muhammad bin Bakhtiyar Khilji in 1204 CE, which introduced sustained Muslim political dominance and facilitated the integration of Persian and Arabic lexical elements into Bengali.[5] This era witnessed the consolidation of Bengali as a literary medium, with phonological simplifications, morphological innovations, and enriched vocabulary reflecting both indigenous evolution and external contacts, including limited Portuguese and Turkish borrowings alongside dominant Perso-Arabic influxes in administrative, military, and religious domains.[5][8] Phonologically, early Middle Bengali (circa 1300–1500 CE) featured the weakening of half-vowels such as ই্ and উ্, the loss of nasal aspirates, and the replacement of nasalized vowel-consonant sequences with simpler nasal sounds plus consonants.[5] In the later phase (1500–1800 CE), word-final অ underwent elision, epenthesis emerged as a process inserting vowels to ease consonant clusters, and a new vowel sound অ্যা (similar to the 'a' in "hat") developed, contributing to dialectal divergences.[5] Grammatically, verbal forms innovated with inflections like -ইল for past tense and -ইব for future in active voice, while post-positions increasingly marked intransitive passives; nominal endings expanded to include -র for genitive, and plural markers such as -গুলা, -গুলি, and -দি(ে)র proliferated, alongside the rise of phrasal and compound verbs that enhanced expressive flexibility.[5] The script transitioned from Proto-Bengali forms (11th–13th centuries) to a more standardized alphabet by the 14th–15th centuries, fully maturing by the 18th century to support literary proliferation.[8] Literary output flourished, providing primary evidence for linguistic reconstruction, with subperiods delineating shifts: transitional (1200–1300 CE) featuring folk legends like Gopī-canda and Behula-Lakhindar; early Middle (1300–1500 CE) marked by Baru Chandidas's Śrīkṛṣṇa Kīrttana (14th century) and Kṛttivāsa Ojha's Ramayana translation (15th century); and late Middle (1500–1800 CE) dominated by Vaishnava padavali lyrics and Chaitanya biographies under the Bhakti movement's influence.[8][17] Key genres included mangal-kavya narrative poems exalting local deities (e.g., Vijay Gupta's Manasamangal, 1494–95 CE), Sanskrit epic adaptations like Kashiram Das's Mahabharata (1602–1610 CE), and Muslim-authored works such as Shah Muhammad Sagir's Yusuf-Zulekha (circa 1400 CE), blending romantic and ethical themes from Perso-Arabic traditions.[17] Shakta poetry and purbabanga-gitika folk songs further diversified expression, underscoring Bengali's adaptation to syncretic cultural contexts without supplanting core Indo-Aryan structures.[5][17]Modern Bengali Period (1800–Present)
The modern period of Bengali linguistic development, beginning around 1800, coincided with British colonial administration and the Bengal Renaissance, which spurred the creation of prose literature and administrative texts to facilitate governance. Fort William College, established in 1800 in Calcutta, played a pivotal role by commissioning Bengali writers to produce textbooks and grammars, thereby standardizing prose forms and reducing reliance on poetic structures dominant in earlier eras.[18] This effort introduced greater Sanskrit-derived tatsama vocabulary into everyday usage, countering the Perso-Arabic influences from the Middle period, and laid the foundation for a more uniform written standard across dialects.[18] In the 19th century, reformers like Ishwar Chandra Vidyasagar advanced Bengali syntax toward greater analytic simplicity, eliminating complex verb conjugations and case endings in favor of postpositions, making the language more accessible for print media and education.[19] The advent of the printing press in 1818 accelerated this by disseminating novels, essays, and newspapers, which expanded vocabulary through neologisms and translations from English and Sanskrit sources. By the early 20th century, colloquial chalit bhasha began supplanting the formal sadhu bhasha in literature, reflecting spoken norms and enhancing readability, though regional variations persisted between eastern (more tadbhava and Perso-Arabic heavy) and western dialects.[20] The 1952 Bengali Language Movement in East Pakistan marked a linguistic turning point, as protests against Urdu's imposition as the sole state language resulted in Bengali's recognition as an official medium, fostering national identity tied to the tongue spoken by over 50% of Pakistan's population at the time.[7] This event, culminating in deaths on February 21, catalyzed institutional support for Bengali in education and administration, influencing orthographic consistency and literary output; it also inspired global recognition via UNESCO's International Mother Language Day in 1999. Post-1947 partition, Bengali evolved divergently: in West Bengal (India), Sanskritization intensified for cultural revival, while in Bangladesh (formerly East Pakistan), post-1971 independence emphasized decolonization with retained Islamic lexical elements. English exerted substantial lexical influence from the colonial era onward, introducing over 1,000 direct loanwords for technology, governance, and science—such as ṭren (train) and bibi (baby)—often adapted phonetically without native equivalents, comprising up to 5-10% of modern urban vocabulary in code-mixed forms like "Banglish."[21] 20th-century orthographic adjustments addressed inconsistencies in vowel notation and conjunct consonants to accommodate printing and typewriters, promoting phonetic alignment over etymological spelling. Today, digital media and globalization sustain Bengali's vitality, with over 230 million speakers, though urban youth increasingly blend it with English, raising concerns about purism versus pragmatic adaptation.[22]Geographical Distribution
Speaker Demographics
Bengali has approximately 233 million native speakers, ranking it among the top ten most spoken languages by first-language users.[23] An additional 32 million individuals speak it as a second language, bringing the total number of proficient speakers to around 265 million.[1] The overwhelming majority of native speakers reside in the Indian subcontinent, primarily in Bangladesh and the eastern Indian states. In Bangladesh, Bengali serves as the mother tongue for about 98% of the population, which numbered 171.5 million in 2023, yielding roughly 168 million first-language speakers.[24][25] In India, native Bengali speakers total approximately 97 million, concentrated mainly in West Bengal (around 80 million) and Tripura, based on 2011 census figures adjusted for subsequent population growth.[26] Smaller communities exist in Assam, Jharkhand, and other states, as well as in neighboring countries like Nepal.[27] Diaspora populations contribute to global speaker demographics, with significant expatriate communities from Bangladesh and West Bengal maintaining the language. Over 2 million Bangladeshis live in Saudi Arabia alone, alongside substantial groups in the United Arab Emirates (around 94,000) and other Gulf states.[28] In Western countries, notable concentrations include about 300,000 Bangladeshis in the United States and hundreds of thousands in the United Kingdom, many of whom speak Bengali at home.[29] These overseas communities, totaling several million, often preserve Bengali through family use and cultural institutions despite pressures of assimilation.[30]Official and Recognized Status
Bengali serves as the national and sole official language of Bangladesh, as enshrined in Article 3 of the Constitution of the People's Republic of Bangladesh, which designates it as the rāṣṭrabhāṣā (state language).[31] This status was reinforced by the Bengali Language Introduction Act of 1987, which mandates its use in all official correspondence, legislation, and court proceedings throughout the country.[20] In India, Bengali is listed among the 22 scheduled languages in the Eighth Schedule to the Constitution, affording it federal recognition for promotion, preservation, and potential use in Parliament and Union administration.[32] It functions as the primary official language of the states of West Bengal and Tripura, where state governments conduct administration, education, and public services predominantly in Bengali.[33] In Assam's Barak Valley—encompassing the districts of Cachar, Karimganj, and Hailakandi—Bengali shares co-official status with Assamese under provisions of the Assam Official Language Act, accommodating the region's Bengali-majority population.[34] On October 4, 2024, the Government of India granted Bengali classical language status, recognizing its literary heritage spanning over 1,500 years, which entitles it to enhanced funding for research, university chairs, and cultural initiatives.[35] This designation underscores Bengali's historical depth, with ancient texts like the Charyapada (circa 8th–12th centuries CE) evidencing its early development, though it does not confer additional administrative privileges beyond scheduled language protections.[2]Dialects and Varieties
Regional Dialects
Bengali regional dialects are primarily classified into four major clusters—Rāṛhī, Vārendrī, Vāṅgīya, and Kāmrūpī—based on phonological, morphological, and geographical criteria established by linguist Suniti Kumar Chatterji.[36] The Rāṛhī group, prevalent in southwestern West Bengal and adjacent areas, serves as the foundation for standard colloquial Bengali, characterized by retention of aspirated consonants and a relatively conservative vowel system.[36] Vārendrī dialects, spoken in northern regions like Rajshahi and Bogura divisions in Bangladesh and northern West Bengal, feature distinct nasalization patterns and vowel lengthening absent in standard forms.[36] Vāṅgīya varieties occupy central zones, including Nadia and Jessore, with intermediate traits blending western conservatism and eastern innovations such as simplified aspiration.[36] Kāmrūpī dialects, found in northeastern areas like Sylhet, exhibit heavy influence from adjacent Assamese, including retroflex sounds and lexical borrowings, though they remain part of the Bengali continuum.[36] Eastern dialects, broadly encompassing Vāṅgīya and Kāmrūpī extensions in Bangladesh, display greater phonetic shifts, such as debuccalization of retroflexes and enhanced vowel harmony, reflecting substrate influences from pre-Indo-Aryan languages.[37] In contrast, western dialects preserve more proto-Indo-Aryan features, like fuller consonant clusters.[37] Peripheral varieties like Sylheti (northeastern Bangladesh and India) and Chittagonian (southeastern Bangladesh) deviate significantly, with Sylheti showing low mutual intelligibility with standard Bengali due to unique phonemes like implosives and lexical divergence.[38] Chittagonian similarly lacks full intelligibility, featuring tone-like suprasegmentals and Arabic-Persian substrate effects from historical migrations, prompting debates on its status as a distinct language rather than a dialect.[39] Dialects such as Dhaka (central Bangladesh), approximating the literary standard, and Barisal (south-central), with pronounced vowel elongation, maintain higher intelligibility across the continuum.[36] These variations arise from geographic isolation, migration, and contact with Tibeto-Burman or Austroasiatic languages, yet core grammar remains shared.[40]Standardization Processes
The standardization of Bengali emerged in the late 18th century through colonial administrative needs, with Nathaniel Brassey Halhed's A Grammar of the Bengal Language (1778) marking the first use of printed Bengali type and contributing to early grammatical codification.[5] The establishment of Fort William College in 1800 further advanced this by commissioning Bengali grammars, dictionaries, and prose textbooks to train British officers, thereby fostering a uniform prose style distinct from poetic traditions.[41] These efforts shifted Bengali from primarily oral and verse-based forms toward standardized written prose, drawing on the Rarhiya dialect spoken around Kolkata and Nadia as a base.[41] In the 19th century, indigenous scholars built on these foundations amid the Bengali Renaissance, emphasizing grammatical rules and vocabulary norms influenced by Sanskrit derivations, though this Sanskritization drew criticism from some Muslim intellectuals for marginalizing Perso-Arabic elements prevalent in eastern dialects.[42] The Bangiya Sahitya Parishad, founded in 1894, institutionalized these processes by compiling reference dictionaries, translating foreign works into Bengali, and promoting literary uniformity across diverse regional varieties.[43] Its activities reinforced a Kolkata-centric standard, prioritizing tatsama (Sanskrit-derived) vocabulary in formal registers.[44] Orthographic standardization accelerated in the 20th century, with the University of Calcutta implementing spelling reforms in 1936 to resolve inconsistencies in script rendering, such as vowel notations and conjunct forms, which had varied due to scribal traditions and printing limitations.[45] These reforms aimed at phonetic consistency while preserving etymological ties to Sanskrit and Prakrit roots, though implementation remained uneven across dialects.[41] Post-1947 partition introduced divergences: West Bengal continued Sanskrit-leaning norms under institutions like the Bangiya Sahitya Parishad, while Bangladesh established the Bangla Academy in 1955 to adapt standards for eastern varieties, favoring tadbhava (native-evolved) and Perso-Arabic terms in official usage to reflect demographic realities.[46] Despite these preferences, the shared literary canon—rooted in 19th-century prose—ensures mutual intelligibility in standard forms, with differences primarily lexical rather than structural.[47] Ongoing efforts in both regions address digital encoding and dialectal inclusion, but no unified pan-Bengali authority exists, perpetuating subtle formal variations.[45]Phonology
Consonant and Vowel Inventory
The Bengali consonant inventory consists of 28 phonemes according to analyses from the Central Institute of Indian Languages, encompassing stops, affricates, fricatives, nasals, approximants, and rhotics across multiple places of articulation.[48] These include voiceless and voiced stops—aspirated and unaspirated—at bilabial, dental, retroflex, palatal (affricates), and velar places, yielding 20 such obstruents; additional nasals (/m, n, ŋ/), fricatives (/ʃ, h/), lateral (/l/), rhotic (/r/), and flap (/ɼ/).[48] Some inventories count 29 consonants by distinguishing palatal nasal /ɲ/ separately from /n/ allophones, though /n/ assimilates to [ɲ] before palatals.[49] Glottal /h/ often realizes as breathy [ɦ] intervocalically, and /ʃ/ exhibits allophones including [s̪] and [ʂ].[48]| Place/Manner | Bilabial | Dental | Retroflex | Palatal/Affricate | Velar | Glottal |
|---|---|---|---|---|---|---|
| Stops (voiceless unaspirated) | /p/ | /t/ | /ʈ/ | /t͡ʃ/ | /k/ | |
| Stops (voiced unaspirated) | /b/ | /d/ | /ɖ/ | /d͡ʒ/ | /g/ | |
| Stops (voiceless aspirated) | /pʰ/ | /tʰ/ | /ʈʰ/ | /t͡ʃʰ/ | /kʰ/ | |
| Stops (voiced aspirated) | /bʰ/ | /dʰ/ | /ɖʰ/ | /d͡ʒʰ/ | /gʰ/ | |
| Nasals | /m/ | /n/ | /ŋ/ | |||
| Fricatives | /ʃ/ | /h/ | ||||
| Approximants/Rhotics | ||||||
| Lateral/Flap | /l/ | /ɼ/ | /r/ (trill/flap) |
Suprasegmental Features
Bengali features a predictable, non-contrastive stress pattern, with primary stress typically assigned to the initial syllable of polysyllabic words, particularly content words, contributing to rhythmic emphasis rather than lexical distinction. This fixed initial stress aligns with the language's syllable structure and prosodic organization, where secondary stresses may occur on alternating syllables in longer words, but deviations are rare and non-phonemic. Unlike languages with lexical stress (e.g., English), Bengali stress does not alter word meaning and is often overshadowed by intonational contours for prominence.[50][51] Intonation in Bengali operates within an autosegmental-metrical framework, characterized by pitch accents (e.g., high H* or low L* on stressed syllables) and boundary tones that signal phrase-level distinctions such as statements, questions, and focus. Declarative sentences typically end with a low boundary tone (L%), while yes-no questions feature a rising high tone (H%) at the phrase boundary, and wh-questions may employ a bitonal pitch accent for emphasis. The basic prosodic unit is the accentual phrase, often comprising two content words with an underlying high-low pitch sequence (H L), which delimits grouping and rhythm. This system, documented through perceptual and acoustic analyses, lacks lexical tones but uses fundamental frequency (F0) variations for pragmatic and syntactic cues.[50] Bengali exhibits a syllable-timed rhythm, where syllables occur at relatively equal intervals, contrasting with stress-timed languages and influencing speech rate and durational patterns in connected speech. Nasalization, while primarily segmental (affecting vowels phonemically), can extend suprasegmentally in prosodic contexts like vowel harmony or emphatic lengthening, though it does not function as a tone or stress marker. These features collectively support efficient information encoding in rapid speech, as observed in acoustic studies of native production.[52][53]Orthography
Bengali-Assamese Script
The Bengali-Assamese script, also known as Eastern Nagari, is an abugida derived from the eastern variant of the Brahmi script, specifically Kutilalipi, which developed a distinctive form around the 7th century CE.[4] This script evolved through intermediate stages from Magadhi Prakrit via Magadhi Apabhramsa and Avahattha, serving as the primary writing system for Bengali and Assamese languages in eastern India and Bangladesh.[4] It is written from left to right, with consonants carrying an inherent vowel sound /ô/, which can be modified or suppressed using diacritic marks called matras for other vowels.[4] As an abugida, the script forms syllables by combining consonant letters with optional vowel signs; compound consonants are represented through conjunct ligatures, often employing half-forms of letters stacked horizontally or vertically to indicate clustering without intervening vowels.[4] This system requires a large character set, historically 448 to 536 glyphs in metal type foundries to accommodate variations and conjuncts.[4] The script lacks case distinction and uses virama (halant) to suppress the inherent vowel in consonants.[54] While Bengali and Assamese share the same script with high glyph similarity, minor differences exist, such as Assamese employing a distinct form for the letter wa (ৱ, wô) derived from an older variant of ra, and variations in the ra phoneme (র in Bengali versus historical forms in Assamese manuscripts).[55] These orthographic distinctions reflect phonetic divergences, with Assamese retaining sounds like /w/ more prominently.[56] In Unicode, the script is encoded in the Bengali block (U+0980–U+09FF), which accommodates both languages without separate blocks, treating Assamese as a variant; the block supports additional characters for Assamese-specific usages.[54] Historical reforms facilitated printing and standardization: Nathaniel Brassey Halhed's 1778 grammar introduced early printed Bengali, followed by Charles Wilkins' movable type in 1800; Ishwar Chandra Vidyasagar's 1840s rearrangement optimized letter order for compositors.[4] Spelling standardization advanced in 1936 through initiatives by the University of Calcutta, reducing inconsistencies in orthography.[45] Later typographic developments included Linotype adoption in 1935 and modern digital fonts like those from the Institute of Typographical Research.[4]Historical and Variant Scripts
The Bengali script's historical precursors trace back through the eastern Brahmi lineage, with significant evolution during the Gupta period (4th–6th centuries CE), where cursive forms emerged in copper plate inscriptions from regions like North Bengal and Comilla.[3] By the 7th century CE, the Kutila script developed, characterized by bent arms and triangular elements, as observed in the Nidhanpur copper plate.[3] During the Pala period (8th–12th centuries CE), Proto-Nagari forms transitioned into Proto-Bangla script in the 9th–10th centuries CE, visible in inscriptions such as those from Khalimpur and Bangarh.[3] This Proto-Bangla, also referred to as Gaudi script in some scholarly accounts, served as the direct antecedent to the modern Bengali-Assamese script and appeared on silver coins of the Harikela Kingdom circa 9th–13th centuries CE.[3] [8] The fully developed modern Bangla script materialized by the 11th–12th centuries CE, as evidenced by the Anulia copper plate around 1196 CE and the Sundarban plate.[3] A prominent variant script, Sylheti Nagri, originated in the early 14th century CE, drawing from Bengali, Kaithi, Devanagari, and Arabic influences, and was primarily utilized by Muslim writers in Sylhet and nearby areas including Kishoreganj, Mymensingh, Netrakona, Kachhar, and Karimganj for religious puthis and literary works often incorporating Arabic and Persian terms.[57] The script's earliest dated manuscript is Talib Huson from 1549 CE, with around 150 extant texts by approximately 60 authors; additional specimens appear on Afghan coins from the late 16th to early 17th centuries CE.[57] Printing efforts, such as those by Maulvi Abdul Karim in Sylhet around 1860–1870 CE, briefly promoted its use through primers like Sylheti Nagrir Pahela Ketab, but it largely fell into disuse with the standardization and dominance of the Bengali-Assamese script.[57]Romanization and Digital Standards
Various systems for romanizing Bengali into the Latin alphabet have been developed, primarily for scholarly, bibliographic, and international purposes. The ALA-LC romanization table, maintained by the Library of Congress, provides a scheme for transliterating Bengali characters, supplying the implicit vowel a after consonants unless otherwise indicated and using diacritics for distinctions like long vowels (e.g., ā for অ).[58] ISO 15919, an international standard published in 2001, extends this approach to Indic scripts including Bengali, employing macrons for long vowels (e.g., ā, ī), underdots for retroflex sounds (e.g., ṭ, ḍ), and hooks for aspirates (e.g., kh, gh), to ensure precise phonetic representation without ambiguity.[59][60] These systems prioritize consistency over phonetic transcription, differing from informal practices common in digital media where ad-hoc spellings (e.g., "bangla" for বাংলা) prevail, often reflecting spoken dialects rather than orthographic fidelity.[61] The United Nations Group of Experts on Geographical Names endorses a romanization for Bengali place names aligned with ISO principles, facilitating global standardization while accommodating script-specific features like matras (vowel signs).[62] In practice, romanization aids non-native access to Bengali texts but faces challenges from the script's abugida nature, where consonants inherently include schwa (ə), requiring decisions on elision (e.g., কলকাতা as Kalikata rather than Kolkāta in some transcriptions). Scholarly works often favor ISO 15919 for its extensibility to related scripts like Assamese.[63] Digitally, Bengali adheres to Unicode standards, with the script encoded in the Bengali block (U+0980–U+09FF), initially derived from the ISCII-1988 layout and incorporated since Unicode 1.1 in 1993.[64] This block supports 128 characters, including independent vowels, consonants, matras, and conjunct forms essential for rendering complex ligatures, such as য + ় + য় for y+y clusters.[54] Normalization forms (NFC/NFD) address decomposition of precomposed characters, while rendering engines must handle reordering of virama-suppressed conjuncts and zwnj/zwj for glyph selection, as outlined in Unicode's core specification.[65] Collation and sorting follow Unicode Collation Algorithm tailoring for Bengali, prioritizing matra positions and ignoring certain modifiers for linguistic accuracy, with W3C guidelines emphasizing bidirectional text support and line-breaking rules around punctuation like danda (।).[66] Input standards include phonetic methods (e.g., mapping QWERTY to Bengali via Avro) and fixed layouts (e.g., Bijoy's legacy 8-bit encoding migrated to Unicode), promoting interoperability across platforms despite early encoding mismatches from proprietary systems.[67] These digital frameworks enable widespread online use, though legacy non-Unicode fonts persist in regions with limited adoption.Grammar
Syntactic Structure
Bengali syntax is characterized by a predominantly Subject-Object-Verb (SOV) word order in main clauses, reflecting its head-final typology common among Indo-Aryan languages.[68][69] This structure positions the verb at the end, with subjects and objects marked morphologically to permit some scrambling for emphasis or topicalization without loss of grammaticality.[70] Postpositions, rather than prepositions, govern oblique cases, attaching to the right of noun phrases (e.g., bari-te 'in the house', where -te marks locative case).[70][71] Noun phrases follow a head-final order, with attributive adjectives, numerals, and possessors preceding the head noun, often without articles but with optional classifiers or demonstratives (e.g., ek-tA boro ghar 'one big house').[72] Case marking on nouns and pronouns is realized through enclitic postpositions or suffixes, including nominative (unmarked), accusative-genitive (-ke or -r), locative (-te), and ablative (-theke), enabling the language's nominative-accusative alignment while supporting flexible constituent order.[71][73] Verb phrases consist of a root combined with tense-aspect markers followed by person-honorific endings, with agreement primarily in person (first, second, third) and honorific levels rather than number or gender, except in limited contexts like second-person past forms.[69][74] Subordinate clauses typically precede main clauses, introduced by complementizers like je 'that' or jodi 'if', maintaining the overall head-final pattern.[75] Negation in finite clauses is achieved by appending the particle nā to the inflected verb (e.g., karlam nā 'did not do'), while perfective forms may use ni instead; this sentential negation does not trigger widespread changes in word order.[76][77] Yes-no questions retain SOV order but add the interrogative particle ki clause-initially or rely on intonation, whereas wh-questions place interrogatives (e.g., kāke 'whom') in situ or fronted for focus, with verb-final positioning preserved.[78] These features contribute to Bengali's analytic tendencies, where context and morphology compensate for rigid order constraints.[79]Nominal and Verbal Morphology
Bengali nouns lack grammatical gender and exhibit limited inflectional morphology, primarily marking number through optional suffixes or zero marking for certain collectives and mass nouns. The plural is typically formed by suffixes such as -gulo for countable nouns referring to small sets or -era for larger groups, though many plurals remain unmarked in context due to the language's analytic nature.[80] [81] Definiteness is indicated by the suffix -টা (for singular masculines or inanimates) or its variants -টি and -টি, which attach directly to the noun stem.[80] Case relations are not expressed through extensive declensional endings but via postpositions that govern either the nominative (unmarked) or genitive form of the noun. The genitive case, marked by -er (after vowels) or -r (after consonants), serves as the base for most oblique cases, including locative (-te), ablative (-theke), instrumental (- diye), and comitative (-songe). Nominative case remains unmarked for subjects and direct objects in simple transitive clauses. Adjectives and numerals precede the noun without agreement in case, number, or gender, remaining invariant.[80] [82] Pronouns follow similar patterns but distinguish three persons and honorific levels, with third-person pronouns showing natural gender distinctions (e.g., se for human masculine/feminine, ta for non-human).[80] Verbal morphology in Bengali combines tense, aspect, and person through agglutinative suffixes on the verb root, with finite verbs conjugating for three persons (first, second, third) and honorific distinctions via auxiliary selection or endings. Verbs divide into classes based on stem ending (vowel-final or consonant-final), affecting conjugation patterns; for example, vowel-final roots like khawa "to eat" add person markers directly, while consonant-final roots like kar "to do" from kora may nasalize or insert vowels. Three main tenses are distinguished: present (unmarked or with person endings), past (marked by -l(o) or -i(l)), and future (marked by -b(o)).[69] [80] Aspectual distinctions include simple (no marker), continuous (infix -ch-), and perfect (suffix -e or -chhe for ongoing perfective). These combine with tenses to form nine primary tense-aspect forms, such as present continuous (-ch(e)) or past perfect (-e chhil(o)). Person agreement appears prominently in present and future tenses (e.g., first person singular -i, second informal -i, third human -e), but past tense often defaults to neutral forms with contextual person inference. Moods include indicative (default), imperative (bare stem for informal second person, or -o for polite), and conditional (past stem + -e). Auxiliaries like howa "to be" compound for passive or progressive constructions, as in khaw-a hoy "is eaten."[69] [80] [83]| Tense-Aspect | 1st Sg. Example (kari "I do/eat") | 3rd Sg. Human Example |
|---|---|---|
| Present Simple | kari | kare |
| Present Continuous | kar-ch(i) | kar-ch(e) |
| Past Simple | khar-lam | khar-lo |
| Future Simple | kar-bo | kar-be |