Recent from talks
Nothing was collected or created yet.
Arabic
View on Wikipedia
Arabic[c] is a Central Semitic language of the Afroasiatic language family spoken primarily in the Arab world.[13] The International Organization for Standardization (ISO) assigns language codes to 32 varieties of Arabic, including its standard form of Literary Arabic, known as Modern Standard Arabic,[14] which is derived from Classical Arabic. This distinction exists primarily among Western linguists; Arabic speakers themselves generally do not distinguish between Modern Standard Arabic and Classical Arabic, but rather refer to both as al-ʿarabiyyatu l-fuṣḥā (اَلعَرَبِيَّةُ ٱلْفُصْحَىٰ[15] "the eloquent Arabic") or simply al-fuṣḥā (اَلْفُصْحَىٰ).
Arabic is the third most widespread official language after English and French,[16] one of six official languages of the United Nations,[17] and the liturgical language of Islam.[18] Arabic is widely taught in schools and universities around the world and is used to varying degrees in workplaces, governments and the media.[18] During the Middle Ages, Arabic was a major vehicle of culture and learning, especially in science, mathematics and philosophy. As a result, many European languages have borrowed words from it. Arabic influence, mainly in vocabulary, is seen in European languages (mainly Spanish and to a lesser extent Portuguese, Catalan, and Sicilian) owing to the proximity of Europe and the long-lasting Arabic cultural and linguistic presence, mainly in Southern Iberia, during the Al-Andalus era. Maltese is a Semitic language developed from a dialect of Arabic and written in the Latin alphabet.[19] The Balkan languages, including Albanian, Greek, Serbo-Croatian, and Bulgarian, have also acquired many words of Arabic origin, mainly through direct contact with Ottoman Turkish.
Arabic has influenced languages across the globe throughout its history, especially languages where Islam is the predominant religion and in countries that were conquered by Muslims. The most markedly influenced languages are Persian, Turkish, Hindustani (Hindi and Urdu),[20] Kashmiri, Kurdish, Bosnian, Kazakh, Bengali, Malay (Indonesian and Malaysian), Maldivian, Pashto, Punjabi, Albanian, Armenian, Azerbaijani, Sicilian, Spanish, Greek, Bulgarian, Tagalog, Sindhi, Odia,[21] Hebrew and African languages such as Hausa, Amharic, Tigrinya, Somali, Tamazight, and Swahili. Conversely, Arabic has borrowed some words (mostly nouns) from other languages, including its sister-language Aramaic, Persian, Greek, and Latin and to a lesser extent and more recently from Turkish, English, French, and Italian.
Arabic is spoken by as many as 380 million speakers, both native and non-native, in the Arab world,[1] making it the fifth most spoken language in the world[22] and the fourth most used language on the Internet in terms of users.[23][24] It also serves as the liturgical language of more than 2 billion Muslims.[17] In 2011, Bloomberg Businessweek ranked Arabic the fourth most useful language for business, after English, Mandarin Chinese, and French.[25] Arabic is written with the Arabic alphabet, an abjad script that is written from right to left.
Classical Arabic (and Modern Standard Arabic) is considered a conservative language among Semitic languages, it preserved the complete Proto-Semitic three grammatical cases and declension (ʾiʿrāb), and it was used in the reconstruction of Proto-Semitic since it preserves as contrastive 28 out of the evident 29 consonantal phonemes.[26]
Classification
[edit]Arabic is usually classified as a Central Semitic language. Linguists still differ as to the best classification of Semitic language sub-groups.[27] The Semitic languages changed between Proto-Semitic and the emergence of Central Semitic languages, particularly in grammar. Innovations of the Central Semitic languages—all maintained in Arabic—include:
- The conversion of the suffix-conjugated stative formation (jalas-) into a past tense.
- The conversion of the prefix-conjugated preterite-tense formation (yajlis-) into a present tense.
- The elimination of other prefix-conjugated mood/aspect forms (e.g., a present tense formed by doubling the middle root, a perfect formed by infixing a /t/ after the first root consonant, probably a jussive formed by a stress shift) in favor of new moods formed by endings attached to the prefix-conjugation forms (e.g., -u for indicative, -a for subjunctive, no ending for jussive, -an or -anna for energetic).
- The development of an internal passive.
There are several features which Classical Arabic, the modern Arabic varieties, as well as the Safaitic and Hismaic inscriptions share which are unattested in any other Central Semitic language variety, including the Dadanitic and Taymanitic languages of the northern Hejaz. These features are evidence of common descent from a hypothetical ancestor, Proto-Arabic.[28][29] The following features of Proto-Arabic can be reconstructed with confidence:[30]
- negative particles m * /mā/; lʾn */lā-ʾan/ to Classical Arabic lan
- mafʿūl G-passive participle
- prepositions and adverbs f, ʿn, ʿnd, ḥt, ʿkdy
- a subjunctive in -a
- t-demonstratives
- leveling of the -at allomorph of the feminine ending
- ʾn complementizer and subordinator
- the use of f- to introduce modal clauses
- independent object pronoun in (ʾ)y
- vestiges of nunation
On the other hand, several Arabic varieties are closer to other Semitic languages and maintain features not found in Classical Arabic, indicating that these varieties cannot have developed from Classical Arabic.[31][32] Thus, Arabic vernaculars do not descend from Classical Arabic:[33] Classical Arabic is a sister language rather than their direct ancestor.[28]
History
[edit]Old Arabic
[edit]Arabia had a wide variety of Semitic languages in antiquity. The term "Arab" was initially used to describe those living in Mesopotamia, Levant, Sinai, and the Arabian Peninsula, as perceived by geographers from ancient Greece.[13][34] In the southwest, various Central Semitic languages both belonging to and outside the Ancient South Arabian family (e.g. Southern Thamudic) were spoken. It is believed that the ancestors of the Modern South Arabian languages (non-Central Semitic languages) were spoken in southern Arabia at this time. To the north, in the oases of northern Hejaz, Dadanitic and Taymanitic held some prestige as inscriptional languages. In Najd and parts of western Arabia, a language known to scholars as Thamudic C is attested.[13]
In eastern Arabia, inscriptions in a script derived from ASA attest to a language known as Hasaitic. On the northwestern frontier of Arabia, various languages known to scholars as Thamudic B, Thamudic D, Safaitic, and Hismaic are attested. The last two share important isoglosses with later forms of Arabic, leading scholars to theorize that Safaitic and Hismaic are early forms of Arabic and that they should be considered Old Arabic.[13]
Linguists generally believe that "Old Arabic", a collection of related dialects that constitute the precursor of Arabic, first emerged during the Iron Age.[27] Previously, the earliest attestation of Old Arabic was thought to be a single 1st century CE inscription in Sabaic script at Qaryat al-Faw, in southern present-day Saudi Arabia. However, this inscription does not participate in several of the key innovations of the Arabic language group, such as the conversion of Semitic mimation to nunation in the singular. It is best reassessed as a separate language on the Central Semitic dialect continuum.[35]
It was also thought that Old Arabic coexisted alongside—and then gradually displaced—epigraphic Ancient North Arabian (ANA), which was theorized to have been the regional tongue for many centuries. ANA, despite its name, was considered a very distinct language, and mutually unintelligible, from "Arabic". Scholars named its variant dialects after the towns where the inscriptions were discovered (Dadanitic, Taymanitic, Hismaic, Safaitic).[27] However, most arguments for a single ANA language or language family were based on the shape of the definite article, a prefixed h-. It has been argued that the h- is an archaism and not a shared innovation, and thus unsuitable for language classification, rendering the hypothesis of an ANA language family untenable.[36] Safaitic and Hismaic, previously considered ANA, should be considered Old Arabic due to the fact that they participate in the innovations common to all forms of Arabic.[13]
The earliest attestation of continuous Arabic text in an ancestor of the modern Arabic script are three lines of poetry by a man named Garm(')allāhe found in En Avdat, Israel, and dated to around 125 CE.[37] This is followed by the Namara inscription, an epitaph of the Lakhmid king Imru' al-Qays bar 'Amro, dating to 328 CE, found at Namaraa, Syria. From the 4th to the 6th centuries, the Nabataean script evolved into the Arabic script recognizable from the early Islamic era.[38] There are inscriptions in an undotted, 17-letter Arabic script dating to the 6th century CE, found at four locations in Syria (Zabad, Jebel Usays, Harran, Umm el-Jimal). The oldest surviving papyrus in Arabic dates to 643 CE, and it uses dots to produce the modern 28-letter Arabic alphabet. The language of that papyrus and of the Qur'an is referred to by linguists as "Quranic Arabic", as distinct from its codification soon thereafter into "Classical Arabic".[27]
Classical Arabic
[edit]In late pre-Islamic times, a transdialectal and transcommunal variety of Arabic emerged in the Hejaz, which continued living its parallel life after literary Arabic had been institutionally standardized in the 2nd and 3rd century of the Hijra, most strongly in Judeo-Christian texts, keeping alive ancient features eliminated from the "learned" tradition (Classical Arabic).[39] This variety and both its classicizing and "lay" iterations have been termed Middle Arabic in the past, but they are thought to continue an Old Higazi register. It is clear that the orthography of the Quran was not developed for the standardized form of Classical Arabic; rather, it shows the attempt on the part of writers to record an archaic form of Old Higazi.[citation needed]
In the late 6th century AD, a relatively uniform intertribal "poetic koine" distinct from the spoken vernaculars developed based on the Bedouin dialects of Najd, probably in connection with the court of al-Ḥīra. During the first Islamic century, the majority of Arabic poets and Arabic-writing persons spoke Arabic as their mother tongue. Their texts, although mainly preserved in far later manuscripts, contain traces of non-standardized Classical Arabic elements in morphology and syntax.[citation needed]
Standardization
[edit]Abu al-Aswad al-Du'ali (c. 603–689) is credited with standardizing Arabic grammar, or an-naḥw (النَّحو "the way"[40]), and pioneering a system of diacritics to differentiate consonants (نقط الإعجام nuqaṭu‿l-i'jām "pointing for non-Arabs") and indicate vocalization (التشكيل at-tashkīl).[41] Al-Khalil ibn Ahmad al-Farahidi (718–786) compiled the first Arabic dictionary, Kitāb al-'Ayn (كتاب العين "The Book of the Letter ع"), and is credited with establishing the rules of Arabic prosody.[42] Al-Jahiz (776–868) proposed to Al-Akhfash al-Akbar an overhaul of the grammar of Arabic, but it would not come to pass for two centuries.[43] The standardization of Arabic reached completion around the end of the 8th century. The first comprehensive description of the ʿarabiyya "Arabic", Sībawayhi's al-Kitāb, is based first of all upon a corpus of poetic texts, in addition to Qur'an usage and Bedouin informants whom he considered to be reliable speakers of the ʿarabiyya.[44]
Spread
[edit]Arabic spread with the spread of Islam. Following the early Muslim conquests, Arabic gained vocabulary from Middle Persian and Turkish.[45] In the early Abbasid period, many Classical Greek terms entered Arabic through translations carried out at Baghdad's House of Wisdom.[45]
By the 8th century, knowledge of Classical Arabic had become an essential prerequisite for rising into the higher classes throughout the Islamic world, both for Muslims and non-Muslims. For example, Maimonides, the Andalusi Jewish philosopher, authored works in Judeo-Arabic—Arabic written in Hebrew script.[46]
Development
[edit]Ibn Jinni of Mosul, a pioneer in phonology, wrote prolifically in the 10th century on Arabic morphology and phonology in works such as Kitāb Al-Munṣif, Kitāb Al-Muḥtasab, and Kitāb Al-Khaṣāʾiṣ .[47]
Ibn Mada' of Cordoba (1116–1196) realized the overhaul of Arabic grammar first proposed by Al-Jahiz 200 years prior.[43]
The Maghrebi lexicographer Ibn Manzur compiled Lisān al-ʿArab (لسان العرب, "Tongue of Arabs"), a major reference dictionary of Arabic, in 1290.[48]
Neo-Arabic
[edit]Charles Ferguson's koine theory claims that the modern Arabic dialects collectively descend from a single military koine that sprang up during the Islamic conquests; this view has been challenged in recent times. Ahmad al-Jallad proposes that there were at least two considerably distinct types of Arabic on the eve of the conquests: Northern and Central (Al-Jallad 2009). The modern dialects emerged from a new contact situation produced following the conquests. Instead of the emergence of a single or multiple koines, the dialects contain several sedimentary layers of borrowed and areal features, which they absorbed at different points in their linguistic histories.[44] According to Veersteegh and Bickerton, colloquial Arabic dialects arose from pidginized Arabic formed from contact between Arabs and conquered peoples. Pidginization and subsequent creolization among Arabs and arabized peoples could explain relative morphological and phonological simplicity of vernacular Arabic compared to Classical and MSA.[49][50]
In around the 11th and 12th centuries in al-Andalus, the zajal and muwashah poetry forms developed in the dialectical Arabic of Cordoba and the Maghreb.[51]
Nahda
[edit]The Nahda was a cultural and especially literary renaissance of the 19th century in which writers sought "to fuse Arabic and European forms of expression."[52] According to James L. Gelvin, "Nahda writers attempted to simplify the Arabic language and script so that it might be accessible to a wider audience."[52]
In the wake of the industrial revolution and European hegemony and colonialism, pioneering Arabic presses, such as the Amiri Press established by Muhammad Ali (1819), dramatically changed the diffusion and consumption of Arabic literature and publications.[53] Rifa'a al-Tahtawi proposed the establishment of Madrasat al-Alsun in 1836 and led a translation campaign that highlighted the need for a lexical injection in Arabic, to suit concepts of the industrial and post-industrial age (such as sayyārah سَيَّارَة 'automobile' or bākhirah باخِرة 'steamship').[54][55]
In response, a number of Arabic academies modeled after the Académie française were established with the aim of developing standardized additions to the Arabic lexicon to suit these transformations,[56] first in Damascus (1919), then in Cairo (1932), Baghdad (1948), Rabat (1960), Amman (1977), Khartum (1993), and Tunis (1993).[57] They review language development, monitor new words and approve the inclusion of new words into their published standard dictionaries.
In 1997, a bureau of Arabization standardization was added to the Educational, Cultural, and Scientific Organization of the Arab League.[57] These academies and organizations have worked toward the Arabization of the sciences, creating terms in Arabic to describe new concepts, toward the standardization of these new terms throughout the Arabic-speaking world, and toward the development of Arabic as a world language.[57] This gave rise to what Western scholars call Modern Standard Arabic. From the 1950s, Arabization became a postcolonial nationalist policy in countries such as Tunisia, Algeria, Morocco,[58] and Sudan.[59]
Classical, Modern Standard and spoken Arabic
[edit]Arabic usually refers to Standard Arabic, which Western linguists divide into Classical Arabic and Modern Standard Arabic.[60] It could also refer to any of a variety of regional vernacular Arabic dialects, which are not necessarily mutually intelligible.

Classical Arabic is the language found in the Quran, used from the period of Pre-Islamic Arabia to that of the Abbasid Caliphate. Classical Arabic is prescriptive, according to the syntactic and grammatical norms laid down by classical grammarians (such as Sibawayh) and the vocabulary defined in classical dictionaries (such as the Lisān al-ʻArab).[citation needed]
Modern Standard Arabic (MSA) largely follows the grammatical standards of Classical Arabic and uses much of the same vocabulary. However, it has discarded some grammatical constructions and vocabulary that no longer have any counterpart in the spoken varieties and has adopted certain new constructions and vocabulary from the spoken varieties. Much of the new vocabulary is used to denote concepts that have arisen in the industrial and post-industrial era, especially in modern times.[61]
Due to its grounding in Classical Arabic, Modern Standard Arabic is removed over a millennium from everyday speech, which is construed as a multitude of dialects of this language. These dialects and Modern Standard Arabic are described by some scholars as not mutually comprehensible. The former are usually acquired in families, while the latter is taught in formal education settings. However, there have been studies reporting some degree of comprehension of stories told in the standard variety among preschool-aged children.[61]
The relation between Modern Standard Arabic and these dialects is sometimes compared to that of Classical Latin and Vulgar Latin vernaculars (which became Romance languages) in medieval and early modern Europe.[60]
MSA is the variety used in most current, printed Arabic publications, spoken by some of the Arabic media across North Africa and the Middle East, and understood by most educated Arabic speakers. "Literary Arabic" and "Standard Arabic" (فُصْحَى fuṣḥá) are less strictly defined terms that may refer to Modern Standard Arabic or Classical Arabic.[citation needed]
Some of the differences between Classical Arabic (CA) and Modern Standard Arabic (MSA) are as follows:[citation needed]
- Certain grammatical constructions of CA that have no counterpart in any modern vernacular dialect (e.g., the energetic mood) are almost never used in Modern Standard Arabic.[citation needed]
- Case distinctions are very rare in Arabic vernaculars. As a result, MSA is generally composed without case distinctions in mind, and the proper cases are added after the fact, when necessary. Because most case endings are noted using final short vowels, which are normally left unwritten in the Arabic script, it is unnecessary to determine the proper case of most words. The practical result of this is that MSA, like English and Standard Chinese, is written in a strongly determined word order and alternative orders that were used in CA for emphasis are rare. In addition, because of the lack of case marking in the spoken varieties, most speakers cannot consistently use the correct endings in extemporaneous speech. As a result, spoken MSA tends to drop or regularize the endings except when reading from a prepared text.[citation needed]
- The numeral system in CA is complex and heavily tied in with the case system. This system is never used in MSA, even in the most formal of circumstances; instead, a greatly simplified system is used, approximating the system of the conservative spoken varieties.[citation needed]
MSA uses much Classical vocabulary (e.g., dhahaba 'to go') that is not present in the spoken varieties, but deletes Classical words that sound obsolete in MSA. In addition, MSA has borrowed or coined many terms for concepts that did not exist in Quranic times, and MSA continues to evolve.[62] Some words have been borrowed from other languages—notice that transliteration mainly indicates spelling and not real pronunciation (e.g., فِلْم film 'film' or ديمقراطية dīmuqrāṭiyyah 'democracy').[citation needed]
The current preference is to avoid direct borrowings, preferring to either use loan translations (e.g., فرع farʻ 'branch', also used for the branch of a company or organization; جناح janāḥ 'wing', is also used for the wing of an airplane, building, air force, etc.), or to coin new words using forms within existing roots (استماتة istimātah 'apoptosis', using the root موت m/w/t 'death' put into the Xth form, or جامعة jāmiʻah 'university', based on جمع jamaʻa 'to gather, unite'; جمهورية jumhūriyyah 'republic', based on جمهور jumhūr 'multitude'). An earlier tendency was to redefine an older word although this has fallen into disuse (e.g., هاتف hātif 'telephone' < 'invisible caller (in Sufism)'; جريدة jarīdah 'newspaper' < 'palm-leaf stalk').[citation needed]
Colloquial or dialectal Arabic refers to the many national or regional varieties which constitute the everyday spoken language. Colloquial Arabic has many regional variants; geographically distant varieties usually differ enough to be mutually unintelligible, and some linguists consider them distinct languages.[63] However, research indicates a high degree of mutual intelligibility between closely related Arabic variants for native speakers listening to words, sentences, and texts; and between more distantly related dialects in interactional situations.[64]

The varieties are typically unwritten. They are often used in informal spoken media, such as soap operas and talk shows,[66] as well as occasionally in certain forms of written media such as poetry and printed advertising.
Hassaniya Arabic, Maltese, and Cypriot Arabic are only varieties of modern Arabic to have acquired official recognition.[67] Hassaniya is official in Mali[68] and recognized as a minority language in Morocco,[69] while the Senegalese government adopted the Latin script to write it.[11] Maltese is official in (predominantly Catholic) Malta and written with the Latin script. Linguists agree that it is a variety of spoken Arabic, descended from Siculo-Arabic, though it has experienced extensive changes as a result of sustained and intensive contact with Italo-Romance varieties, and more recently also with English. Due to "a mix of social, cultural, historical, political, and indeed linguistic factors", many Maltese people today consider their language Semitic but not a type of Arabic.[70] Cypriot Arabic is recognized as a minority language in Cyprus.[71]
Status and usage
[edit]Diglossia
[edit]The sociolinguistic situation of Arabic in modern times provides a prime example of the linguistic phenomenon of diglossia, which is the normal use of two separate varieties of the same language, usually in different social situations. Tawleed is the process of giving a new shade of meaning to an old classical word. For example, al-hatif lexicographically means the one whose sound is heard but whose person remains unseen. Now the term al-hatif is used for a telephone. Therefore, the process of tawleed can express the needs of modern civilization in a manner that would appear to be originally Arabic.[72]
In the case of Arabic, educated Arabs of any nationality can be assumed to speak both their school-taught Standard Arabic as well as their native dialects, which depending on the region may be mutually unintelligible.[73][74][75][76][77] Some of these dialects can be considered to constitute separate languages which may have "sub-dialects" of their own.[78] When educated Arabs of different dialects engage in conversation (for example, a Moroccan speaking with a Lebanese), many speakers code-switch back and forth between the dialectal and standard varieties of the language, sometimes even within the same sentence.

The issue of whether Arabic is one language or many languages is politically charged, in the same way it is for the varieties of Chinese, Hindi and Urdu, Serbian and Croatian, Scots and English, etc. In contrast to speakers of Hindi and Urdu who claim they cannot understand each other even when they can, speakers of the varieties of Arabic will claim they can all understand each other even when they cannot.[79]
While there is a minimum level of comprehension between all Arabic dialects, this level can increase or decrease based on geographic proximity: for example, Levantine and Gulf speakers understand each other much better than they do speakers from the Maghreb. The issue of diglossia between spoken and written language is a complicating factor: A single written form, differing sharply from any of the spoken varieties learned natively, unites several sometimes divergent spoken forms. For political reasons, Arabs mostly assert that they all speak a single language, despite mutual incomprehensibility among differing spoken versions.[80]
From a linguistic standpoint, it is often said that the various spoken varieties of Arabic differ among each other collectively about as much as the Romance languages.[81] This is an apt comparison in a number of ways. The period of divergence from a single spoken form is similar—perhaps 1500 years for Arabic, 2000 years for the Romance languages. Also, while it is comprehensible to people from the Maghreb, a linguistically innovative variety such as Moroccan Arabic is essentially incomprehensible to Arabs from the Mashriq, much as French is incomprehensible to Spanish or Italian speakers but relatively easily learned by them. This suggests that the spoken varieties may linguistically be considered separate languages.[citation needed]

Status in the Arab world vis-à-vis other languages
[edit]With the sole example of medieval linguist Abu Hayyan al-Gharnati – who, while a scholar of the Arabic language, was not ethnically Arab – medieval scholars of the Arabic language made no efforts at studying comparative linguistics, considering all other languages inferior.[82]
In modern times, the educated upper classes in the Arab world have taken a nearly opposite view. Yasir Suleiman wrote in 2011 that "studying and knowing English or French in most of the Middle East and North Africa have become a badge of sophistication and modernity and ... feigning, or asserting, weakness or lack of facility in Arabic is sometimes paraded as a sign of status, class, and perversely, even education through a mélange of code-switching practises."[83]
As a foreign language
[edit]Arabic has been taught worldwide in many elementary and secondary schools, especially Muslim schools. Universities around the world have classes that teach Arabic as part of their foreign languages, Middle Eastern studies, and religious studies courses. Arabic language schools exist to assist students to learn Arabic outside the academic world. There are many Arabic language schools in the Arab world and other Muslim countries. Because the Quran is written in Arabic and all Islamic terms are in Arabic, millions[84] of Muslims (both Arab and non-Arab) study the language.
Software and books with tapes are an important part of Arabic learning, as many of Arabic learners may live in places where there are no academic or Arabic language school classes available. Radio series of Arabic language classes are also provided from some radio stations.[85] A number of websites on the Internet provide online classes for all levels as a means of distance education; most teach Modern Standard Arabic, but some teach regional varieties from numerous countries.[86]
Vocabulary
[edit]Lexicography
[edit]Pre-modern Arabic lexicography
[edit]The tradition of Arabic lexicography extended for about a millennium before the modern period.[87] Early lexicographers (لُغَوِيُّون lughawiyyūn) sought to explain words in the Quran that were unfamiliar or had a particular contextual meaning, and to identify words of non-Arabic origin that appear in the Quran.[87] They gathered shawāhid (شَوَاهِد 'instances of attested usage') from poetry and the speech of the Arabs—particularly the Bedouin ʾaʿrāb (أَعْراب) who were perceived to speak the "purest," most eloquent form of Arabic—initiating a process of jamʿu‿l-luɣah (جمع اللغة 'compiling the language') which took place over the 8th and early 9th centuries.[87]

Kitāb al-'Ayn (c. 8th century), attributed to Al-Khalil ibn Ahmad al-Farahidi, is considered the first lexicon to include all Arabic roots; it sought to exhaust all possible root permutations—later called taqālīb (تقاليب)—calling those that are actually used mustaʿmal (مستعمَل) and those that are not used muhmal (مُهمَل).[87] Lisān al-ʿArab (1290) by Ibn Manzur gives 9,273 roots, while Tāj al-ʿArūs (1774) by Murtada az-Zabidi gives 11,978 roots.[87]
This lexicographic tradition was traditionalist and corrective in nature—holding that linguistic correctness and eloquence derive from Qurʾānic usage, pre-Islamic poetry, and Bedouin speech—positioning itself against laḥnu‿l-ʿāmmah (لَحْن العامة), the solecism it viewed as defective.[87]
Western lexicography of Arabic
[edit]In the second half of the 19th century, the British Arabist Edward William Lane, working with the Egyptian scholar Ibrāhīm Abd al-Ghaffār ad-Dasūqī,[88] compiled the Arabic–English Lexicon by translating material from earlier Arabic lexica into English.[89] The German Arabist Hans Wehr, with contributions from Hedwig Klein,[90] compiled the Arabisches Wörterbuch für die Schriftsprache der Gegenwart (1952), later translated into English as A Dictionary of Modern Written Arabic (1961), based on established usage, especially in literature.[91]
Modern Arabic lexicography
[edit]The Academy of the Arabic Language in Cairo sought to publish a historical dictionary of Arabic in the vein of the Oxford English Dictionary, tracing the changes of meanings and uses of Arabic words over time.[92] A first volume of Al-Muʿjam al-Kabīr was published in 1956 under the leadership of Taha Hussein.[93] The project is not yet complete; its 15th volume, covering the letter ṣād, was published in 2022.[94]
Loanwords
[edit]
The most important sources of borrowings into (pre-Islamic) Arabic are from the related (Semitic) languages Aramaic,[95] which used to be the principal, international language of communication throughout the ancient Near and Middle East, and Ethiopic. Many cultural, religious and political terms have entered Arabic from Iranian languages, notably Middle Persian, Parthian, and (Classical) Persian,[96] and Hellenistic Greek (kīmiyāʼ has as origin the Greek khymia, meaning in that language the melting of metals; see Roger Dachez, Histoire de la Médecine de l'Antiquité au XXe siècle, Tallandier, 2008, p. 251), alembic (distiller) from ambix (cup), almanac (climate) from almenichiakon (calendar).
For the origin of the last three borrowed words, see Alfred-Louis de Prémare, Foundations of Islam, Seuil, L'Univers Historique, 2002. Some Arabic borrowings from Semitic or Persian languages are, as presented in De Prémare's above-cited book: [citation needed]
- madīnah/medina (مدينة, city or city square), a word of Aramaic origin ܡܕ݂ܝܼܢ݇ܬܵܐ məḏī(n)ttā (in which it means "state/city").[citation needed]
- jazīrah (جزيرة), as in the well-known form الجزيرة "Al-Jazeera", means "island" and has its origin in the Syriac ܓܵܙܲܪܬܵܐ gāzartā.[citation needed]
- lāzaward (لازورد) is taken from Persian لاژورد lājvard, the name of a blue stone, lapis lazuli. This word was borrowed in several European languages to mean (light) blue – azure in English, azur in French and azul in Portuguese and Spanish.[citation needed]

A comprehensive overview of the influence of other languages on Arabic is found in Lucas & Manfredi (2020).[97]
Influence on other languages
[edit]The influence of Arabic has been most important in Islamic countries, because it is the language of the Islamic sacred book, the Quran. Arabic is also an important source of vocabulary for languages such as Amharic, Azerbaijani, Baluchi, Bengali, Berber, Bosnian, Chaldean, Chechen, Chittagonian, Croatian, Dagestani, Dhivehi, English, German, Gujarati, Hausa, Hindi, Kazakh, Kurdish, Kutchi, Kyrgyz, Malay (Malaysian and Indonesian), Pashto, Persian, Punjabi, Rohingya, Romance languages (French, Catalan, Italian, Portuguese, Sicilian, Spanish, etc.) Saraiki, Sindhi, Somali, Sylheti, Swahili, Tagalog, Tigrinya, Turkish, Turkmen, Urdu, Uyghur, Uzbek, Visayan and Wolof, as well as other languages in countries where these languages are spoken.[97] Modern Hebrew has been also influenced by Arabic especially during the process of revival, as MSA was used as a source for modern Hebrew vocabulary and roots.[98]
English has many Arabic loanwords, some directly, but most via other Mediterranean languages. Examples of such words include admiral, adobe, alchemy, alcohol, algebra, algorithm, alkaline, almanac, amber, arsenal, assassin, candy, carat, cipher, coffee, cotton, ghoul, hazard, jar, kismet, lemon, loofah, magazine, mattress, sherbet, sofa, sumac, tariff, and zenith.[99] Other languages such as Maltese[100] and Kinubi derive ultimately from Arabic, rather than merely borrowing vocabulary or grammatical rules.
Terms borrowed range from religious terminology (like Berber taẓallit, "prayer", from salat (صلاة ṣalāh)), academic terms (like Uyghur mentiq, "logic"), and economic items (like English coffee) to placeholders (like Spanish fulano, "so-and-so"), everyday terms (like Hindustani lekin, "but", or Spanish taza and French tasse, meaning "cup"), and expressions (like Catalan a betzef, "galore, in quantity"). Most Berber varieties (such as Kabyle), along with Swahili, borrow some numbers from Arabic. Most Islamic religious terms are direct borrowings from Arabic, such as صلاة (ṣalāh), "prayer", and إمام (imām), "prayer leader".[citation needed]
In languages not directly in contact with the Arab world, Arabic loanwords are often transferred indirectly via other languages rather than being transferred directly from Arabic. For example, most Arabic loanwords in Hindustani and Turkish entered through Persian. Older Arabic loanwords in Hausa were borrowed from Kanuri. Most Arabic loanwords in Yoruba entered through Hausa.[citation needed]
Arabic words made their way into several West African languages as Islam spread across the Sahara. Variants of Arabic words such as كتاب kitāb ("book") have spread to the languages of African groups who had no direct contact with Arab traders.[101]
Since, throughout the Islamic world, Arabic occupied a position similar to that of Latin in Europe, many of the Arabic concepts in the fields of science, philosophy, commerce, etc. were coined from Arabic roots by non-native Arabic speakers, notably by Aramaic and Persian translators, and then found their way into other languages. This process of using Arabic roots, especially in Kurdish and Persian, to translate foreign concepts continued through to the 18th and 19th centuries, when swaths of Arab-inhabited lands were under Ottoman rule.[citation needed]
Spoken varieties
[edit]
- 1: Hassaniyya
- 9: Saidi Arabic
- 10: Chadian Arabic
- 11: Sudanese Arabic
- 13: Najdi Arabic
- 14: Levantine Arabic
- 17: Gulf Arabic
- 18: Baharna Arabic
- 19: Hijazi Arabic
- 20: Shihhi Arabic
- 21: Omani Arabic
- 22: Dhofari Arabic
- 23: Sanaani Arabic
- 25: Hadrami Arabic
- 26: Uzbeki Arabic
- 27: Tajiki Arabic
- 28: Cypriot Arabic
- 29: Maltese
- 30: Nubi
- Sparsely populated area or no indigenous Arabic speakers
- Solid area fill: variety natively spoken by at least 25% of the population of that area or variety indigenous to that area only
- Hatched area fill: minority scattered over the area
- Dotted area fill: speakers of this variety are mixed with speakers of other Arabic varieties in the area
Colloquial Arabic is a collective term for the spoken dialects of Arabic used throughout the Arab world, which differ radically from the literary language. The main dialectal division is between the varieties within and outside of the Arabian peninsula, followed by that between sedentary varieties and the much more conservative Bedouin varieties. All the varieties outside of the Arabian peninsula, which include the large majority of speakers, have many features in common with each other that are not found in Classical Arabic. This has led researchers to postulate the existence of a prestige koine dialect in the one or two centuries immediately following the Arab conquest, whose features eventually spread to all newly conquered areas. These features are present to varying degrees inside the Arabian peninsula. Generally, the Arabian peninsula varieties have much more diversity than the non-peninsula varieties, but these have been understudied.[citation needed]

Within the non-peninsula varieties, the largest difference is between the non-Egyptian North African dialects, especially Moroccan Arabic, and the others. Moroccan Arabic in particular is hardly comprehensible to Arabic speakers east of Libya (although the converse is not true, in part due to the popularity of Egyptian films and other media).[citation needed]
One factor in the differentiation of the dialects is influence from the languages previously spoken in the areas, which have typically provided many new words and have sometimes also influenced pronunciation or word order. However, a more weighty factor for most dialects is, as among Romance languages, retention (or change of meaning) of different classical forms. Thus Iraqi aku, Levantine and Peninsular fīh and North African kayən all mean 'there is', and all come from Classical Arabic forms (yakūn, fīhi, kā'in respectively), but now sound very different.[citation needed]
Koiné
[edit]According to Charles A. Ferguson,[102] the following are some of the characteristic features of the koiné that underlies all the modern dialects outside the Arabian peninsula. Although many other features are common to most or all of these varieties, Ferguson believes that these features in particular are unlikely to have evolved independently more than once or twice and together suggest the existence of the koine:
- Loss of the dual number except on nouns, with consistent plural agreement (cf. feminine singular agreement in plural inanimates).
- Change of a to i in many affixes (e.g., non-past-tense prefixes ti- yi- ni-; wi- 'and'; il- 'the'; feminine -it in the construct state).
- Loss of third-weak verbs ending in w (which merge with verbs ending in y).
- Reformation of geminate verbs, e.g., ḥalaltu 'I untied' → ḥalēt(u).
- Conversion of separate words lī 'to me', laka 'to you', etc. into indirect-object clitic suffixes.
- Certain changes in the cardinal number system, e.g., khamsat ayyām 'five days' → kham(a)s tiyyām, where certain words have a special plural with prefixed t.
- Loss of the feminine elative (comparative).
- Adjective plurals of the form kibār 'big' → kubār.
- Change of nisba suffix -iyy > i.
- Certain lexical items, e.g., jāb 'bring' < jāʼa bi- 'come with'; shāf 'see'; ēsh 'what' (or similar) < ayyu shayʼ 'which thing'; illi (relative pronoun).
- Merger of /dˤ/ ⟨ض⟩ and /ðˤ/ ⟨ظ⟩ in most or all positions.
Dialect groups
[edit]- Egyptian Arabic, spoken by 67 million people in Egypt.[103] It is one of the most understood varieties of Arabic, due in large part to the widespread distribution of Egyptian films and television shows throughout the Arabic-speaking world.
- Levantine Arabic, spoken by about 44 million people in Lebanon, Syria, Jordan, Palestine, Israel, and Turkey.[104]
- Lebanese Arabic is a variety of Levantine Arabic spoken primarily in Lebanon.
- Jordanian Arabic is a continuum of mutually intelligible varieties of Levantine Arabic spoken by the population of the Kingdom of Jordan.
- Palestinian Arabic is a name of several dialects of the subgroup of Levantine Arabic spoken by the Palestinians in Palestine, by Arab citizens of Israel and in most Palestinian populations around the world.
- Samaritan Arabic, spoken by only several hundred in the Nablus region.
- Cypriot Maronite Arabic, spoken in Cyprus by around 9,800 people (2013 UNSD).[105]
- Maghrebi Arabic, also called "Darija", spoken by about 70 million people in Morocco, Algeria, Tunisia and Libya. It also forms the basis of Maltese via the extinct Sicilian Arabic dialect.[106] Maghrebi Arabic is very hard to understand for Arabic speakers from the Mashriq or Mesopotamia, the most comprehensible being Libyan Arabic and the most difficult Moroccan Arabic. The others such as Algerian Arabic can be considered in between the two in terms of difficulty.
- Libyan Arabic, spoken in Libya and neighboring countries.
- Tunisian Arabic, spoken in Tunisia and north-eastern Algeria.
- Algerian Arabic, spoken in Algeria.
- Judeo-Algerian Arabic was spoken by Jews in Algeria until 1962, now it is spoken by a few elderly Algerian Jews in France and Israel.
- Moroccan Arabic, spoken in Morocco.
- Hassaniya Arabic (3 million speakers), spoken in Mauritania, Western Sahara, some parts of the Azawad in northern Mali, southern Morocco, and south-western Algeria.
- Andalusian Arabic, spoken in Spain until the 16th century.
- Siculo-Arabic (Sicilian Arabic), was spoken in Sicily and Malta between the end of the 9th century and the end of the 12th century and eventually evolved into the Maltese language.
- Maltese, spoken on the island of Malta, is the only fully separate standardized language to have originated from an Arabic dialect, the extinct Siculo-Arabic dialect, with independent literary norms. Maltese has evolved independently of Modern Standard Arabic and its varieties into a standardized language over the past 800 years in a gradual process of Latinisation.[107][108] Maltese is therefore considered an exceptional descendant of Arabic that has no diglossic relationship with Standard Arabic or Classical Arabic.[109] Maltese is different from Arabic and other Semitic languages since its morphology has been deeply influenced by Romance languages, Italian and Sicilian.[110] It is the only Semitic language written in the Latin script. In terms of basic everyday language, speakers of Maltese are reported to be able to understand less than a third of what is said to them in Tunisian Arabic,[111] which is related to Siculo-Arabic,[106] whereas speakers of Tunisian are able to understand about 40% of what is said to them in Maltese.[112] This asymmetric intelligibility is considerably lower than the mutual intelligibility found between Maghrebi Arabic dialects.[113] Maltese has its own dialects, with urban varieties of Maltese being closer to Standard Maltese than rural varieties.[114]
- Mesopotamian Arabic, spoken by about 41.2 million people in Iraq (where it is called "Aamiyah"), eastern Syria and southwestern Iran (Khuzestan) and in the southeastern of Turkey (in the eastern Mediterranean, Southeastern Anatolia Region).
- North Mesopotamian Arabic is a spoken north of the Hamrin Mountains in Iraq, in western Iran, northern Syria, and in southeastern Turkey (in the eastern Mediterranean Region, Southeastern Anatolia Region, and southern Eastern Anatolia Region).[115]
- Judeo-Mesopotamian Arabic, also known as Iraqi Judeo Arabic and Yahudic, is a variety of Arabic spoken by Iraqi Jews of Mosul.
- Baghdad Arabic is the Arabic dialect spoken in Baghdad, and the surrounding cities and it is a subvariety of Mesopotamian Arabic.
- Baghdad Jewish Arabic is the dialect spoken by the Iraqi Jews of Baghdad.
- South Mesopotamian Arabic (Basrawi dialect) is the dialect spoken in southern Iraq, such as Basra, Dhi Qar, and Najaf.[116]
- Khuzestani Arabic, spoken in the Iranian province of Khuzestan. This is a mix of Southern Mesopotamian Arabic and Gulf Arabic.
- Khorasani Arabic, spoken in the Iranian province of Khorasan.
- Kuwaiti Arabic is a Gulf Arabic dialect spoken in Kuwait.
- Sudanese Arabic, spoken by 17 million people in Sudan and some parts of southern Egypt. Sudanese Arabic is quite distinct from the dialect of its neighbor to the north; rather, the Sudanese have a dialect similar to the Hejazi dialect.
- Juba Arabic, spoken in South Sudan and southern far Sudan.
- Gulf Arabic, spoken by around four million people, predominantly in Kuwait, Bahrain, some parts of Oman, eastern Saudi Arabia coastal areas and some parts of UAE and Qatar. Also spoken in Iran's Bushehr and Hormozgan provinces. Although Gulf Arabic is spoken in Qatar, most Qatari citizens speak Najdi Arabic (Bedawi).
- Omani Arabic, distinct from the Gulf Arabic of Eastern Arabia and Bahrain, spoken in Central Oman. With its oil wealth and mobility it has spread to various areas of the former Sultanate of Muscat, especially Zanzibar and the Swahili Coast.
- Hadhrami Arabic, spoken by around 8 million people, predominantly in Hadhramaut, and in parts of the Arabian Peninsula, South and Southeast Asia, and East Africa by Hadhrami descendants.
- Indonesian Arabic, spoken in Arab ethnic enclaves in Indonesia, especially along the north coast of Java. It has about 60,000 speakers according to a rough estimate in 2010.[117]
- Yemeni Arabic, spoken in Yemen, and southern Saudi Arabia by 15 million people. Similar to Gulf Arabic.
- Najdi Arabic, spoken by around 10 million people, mainly spoken in Najd, central and northern Saudi Arabia. Most Qatari citizens speak Najdi Arabic (Bedawi).
- Hejazi Arabic (6 million speakers), spoken in Hejaz, western Saudi Arabia.
- Saharan Arabic spoken in some parts of Algeria, Niger and Mali.
- Baharna Arabic (800,000 speakers), spoken by Bahrani Shias in Bahrain and Qatif, the dialect exhibits many big differences from Gulf Arabic. It is also spoken to a lesser extent in Oman.
- Judeo-Arabic dialects – these are the dialects spoken by the Jews that had lived or continue to live in the Arab World. As Jewish migration to Israel took hold, the language did not thrive and is now considered endangered. So-called Qəltu Arabic.
- Chadian Arabic, spoken in Chad, Sudan, some parts of South Sudan, Central African Republic, Niger, Nigeria, and Cameroon.
- Central Asian Arabic, spoken in Uzbekistan, Tajikistan, and Afghanistan by around 8,000 people.[118][119] Tajiki Arabic is highly endangered.[120]
- Shirvani Arabic, spoken in Azerbaijan and Dagestan until the 1930s, now extinct.
Phonology
[edit]While many languages have numerous dialects that differ in phonology, contemporary spoken Arabic is more properly described as a continuum of varieties.[121] Modern Standard Arabic (MSA), is the standard variety shared by educated speakers throughout Arabic-speaking regions. MSA is used in writing in formal print media and orally in newscasts, speeches and formal declarations of numerous types.[122]
Modern Standard Arabic has 28 consonant phonemes and 6 vowel phonemes. The four "emphatic" (pharyngealized) consonants /sˤ, dˤ, tˤ, ðˤ/ contrast with their non-emphatic counterparts /s, d, t, ð/, other consonants including the interdentals /θ, ð/, and the pharyngeals /ħ, ʕ/ are considered rare cross-linguistically. Some of these phonemes have coalesced in the various modern dialects, while new phonemes have been introduced through borrowing or phonemic splits. A "phonemic quality of length" applies to consonants as well as vowels.[123]
Grammar
[edit]
The grammar of Arabic has similarities with the grammar of other Semitic languages. Some of the typical differences between Standard Arabic (فُصْحَى) and vernacular varieties are a loss of morphological markings of grammatical case, changes in word order, a shift toward more analytic morphosyntax, loss of grammatical mood, and loss of the inflected passive voice.
Literary Arabic
[edit]As in other Semitic languages, Arabic has a complex and unusual morphology, i.e. method of constructing words from a basic root. Arabic has a nonconcatenative "root-and-pattern" morphology: A root consists of a set of bare consonants (usually three), which are fitted into a discontinuous pattern to form words. For example, the word for 'I wrote' is constructed by combining the root k-t-b 'write' with the pattern -a-a-tu 'I Xed' to form katabtu 'I wrote'.
Other verbs meaning 'I Xed' will typically have the same pattern but with different consonants, e.g. qaraʼtu 'I read', akaltu 'I ate', dhahabtu 'I went', although other patterns are possible, e.g. sharibtu 'I drank', qultu 'I said', takallamtu 'I spoke', where the subpattern used to signal the past tense may change but the suffix -tu is always used.
From a single root k-t-b, numerous words can be formed by applying different patterns:
- كَتَبْتُ katabtu 'I wrote'
- كَتَّبْتُ kattabtu 'I had (something) written'
- كَاتَبْتُ kātabtu 'I corresponded (with someone)'
- أَكْتَبْتُ 'aktabtu 'I dictated'
- اِكْتَتَبْتُ iktatabtu 'I subscribed'
- تَكَاتَبْنَا takātabnā 'we corresponded with each other'
- أَكْتُبُ 'aktubu 'I write'
- أُكَتِّبُ 'ukattibu 'I have (something) written'
- أُكَاتِبُ 'ukātibu 'I correspond (with someone)'
- أُكْتِبُ 'uktibu 'I dictate'
- أَكْتَتِبُ 'aktatibu 'I subscribe'
- نَتَكَتِبُ natakātabu 'we correspond each other'
- كُتِبَ kutiba 'it was written'
- أُكْتِبَ 'uktiba 'it was dictated'
- مَكْتُوبٌ maktūbun 'written'
- مُكْتَبٌ muktabun 'dictated'
- كِتَابٌ kitābun 'book'
- كُتُبٌ kutubun 'books'
- كَاتِبٌ kātibun 'writer'
- كُتَّابٌ kuttābun 'writers'
- مَكْتَبٌ maktabun 'desk, office'
- مَكْتَبَةٌ maktabatun 'library, bookshop'
- etc.
Nouns and adjectives
[edit]Nouns in Literary Arabic have three grammatical cases (nominative, accusative, and genitive [also used when the noun is governed by a preposition]); three numbers (singular, dual and plural); two genders (masculine and feminine); and three "states" (indefinite, definite, and construct). The cases of singular nouns, other than those that end in long ā, are indicated by suffixed short vowels (/-u/ for nominative, /-a/ for accusative, /-i/ for genitive).
The feminine singular is often marked by ـَة /-at/, which is pronounced as /-ah/ before a pause. Plural is indicated either through endings (the sound plural) or internal modification (the broken plural). Definite nouns include all proper nouns, all nouns in "construct state" and all nouns which are prefixed by the definite article اَلْـ /al-/. Indefinite singular nouns, other than those that end in long ā, add a final /-n/ to the case-marking vowels, giving /-un/, /-an/ or /-in/, which is also referred to as nunation or tanwīn.
Adjectives in Literary Arabic are marked for case, number, gender and state, as for nouns. The plural of all non-human nouns is always combined with a singular feminine adjective, which takes the ـَة /-at/ suffix.
Pronouns in Literary Arabic are marked for person, number and gender. There are two varieties, independent pronouns and enclitics. Enclitic pronouns are attached to the end of a verb, noun or preposition and indicate verbal and prepositional objects or possession of nouns. The first-person singular pronoun has a different enclitic form used for verbs (ـنِي /-nī/) and for nouns or prepositions (ـِي /-ī/ after consonants, ـيَ /-ya/ after vowels).
Nouns, verbs, pronouns and adjectives agree with each other in all respects. Non-human plural nouns are grammatically considered to be feminine singular. A verb in a verb-initial sentence is marked as singular regardless of its semantic number when the subject of the verb is explicitly mentioned as a noun. Numerals between three and ten show "chiasmic" agreement, in that grammatically masculine numerals have feminine marking and vice versa.
Verbs
[edit]Verbs in Literary Arabic are marked for person (first, second, or third), gender, and number. They are conjugated in two major paradigms (past and non-past); two voices (active and passive); and six moods (indicative, imperative, subjunctive, jussive, shorter energetic and longer energetic); the fifth and sixth moods, the energetics, exist only in Classical Arabic but not in MSA.[124] There are two participles, active and passive, and a verbal noun, but no infinitive.
The past and non-past paradigms are sometimes termed perfective and imperfective, indicating the fact that they actually represent a combination of tense and aspect. The moods other than the indicative occur only in the non-past, and the future tense is signaled by prefixing سَـ sa- or سَوْفَ sawfa onto the non-past. The past and non-past differ in the form of the stem (e.g., past كَتَبـ katab- vs. non-past ـكْتُبـ -ktub-), and use completely different sets of affixes for indicating person, number and gender: In the past, the person, number and gender are fused into a single suffixal morpheme, while in the non-past, a combination of prefixes (primarily encoding person) and suffixes (primarily encoding gender and number) are used. The passive voice uses the same person/number/gender affixes but changes the vowels of the stem.
The following shows a paradigm of a regular Arabic verb, كَتَبَ kataba 'to write'. In Modern Standard, the energetic mood, in either long or short form, which has the same meaning, is almost never used.
Derivation
[edit]Like other Semitic languages, and unlike most other languages, Arabic makes much more use of nonconcatenative morphology, applying many templates applied to roots, to derive words than adding prefixes or suffixes to words.
For verbs, a given root can occur in many different derived verb stems, of which there are about fifteen, each with one or more characteristic meanings and each with its own templates for the past and non-past stems, active and passive participles, and verbal noun. These are referred to by Western scholars as "Form I", "Form II", and so on through "Form XV", although Forms XI to XV are rare.
These stems encode grammatical functions such as the causative, intensive and reflexive. Stems sharing the same root consonants represent separate verbs, albeit often semantically related, and each is the basis for its own conjugational paradigm. As a result, these derived stems are part of the system of derivational morphology, not part of the inflectional system.
Examples of the different verbs formed from the root كتب k-t-b 'write' (using حمر ḥ-m-r 'red' for Form IX, which is limited to colors and physical defects):
| Form | Past | Meaning | Non-past | Meaning |
|---|---|---|---|---|
| I | kataba | 'he wrote' | yaktubu | 'he writes' |
| II | kattaba | 'he made (someone) write' | yukattibu | "he makes (someone) write" |
| III | kātaba | 'he corresponded with, wrote to (someone)' | yukātibu | 'he corresponds with, writes to (someone)' |
| IV | ʾaktaba | 'he dictated' | yuktibu | 'he dictates' |
| V | takattaba | nonexistent | yatakattabu | nonexistent |
| VI | takātaba | 'he corresponded (with someone, esp. mutually)' | yatakātabu | 'he corresponds (with someone, esp. mutually)' |
| VII | inkataba | 'he subscribed' | yankatibu | 'he subscribes' |
| VIII | iktataba | 'he copied' | yaktatibu | 'he copies' |
| IX | iḥmarra | 'he turned red' | yaḥmarru | 'he turns red' |
| X | istaktaba | 'he asked (someone) to write' | yastaktibu | 'he asks (someone) to write' |
Form II is sometimes used to create transitive denominative verbs (verbs built from nouns); Form V is the equivalent used for intransitive denominatives.
The associated participles and verbal nouns of a verb are the primary means of forming new lexical nouns in Arabic. This is similar to the process by which, for example, the English gerund "meeting" (similar to a verbal noun) has turned into a noun referring to a particular type of social, often work-related event where people gather together to have a "discussion" (another lexicalized verbal noun). Another fairly common means of forming nouns is through one of a limited number of patterns that can be applied directly to roots, such as the "nouns of location" in ma- (e.g. maktab 'desk, office' < k-t-b 'write', maṭbakh 'kitchen' < ṭ-b-kh 'cook').
The only three genuine suffixes are as follows:
- The feminine suffix -ah; variously derives terms for women from related terms for men, or more generally terms along the same lines as the corresponding masculine, e.g. maktabah 'library' (also a writing-related place, but different from maktab, as above).
- The nisbah suffix -iyy-. This suffix is extremely productive, and forms adjectives meaning "related to X". It corresponds to English adjectives in -ic, -al, -an, -y, -ist, etc.
- The feminine nisbah suffix -iyyah. This is formed by adding the feminine suffix -ah onto nisba adjectives to form abstract nouns. For example, from the basic root š-r-k 'share' can be derived the Form VIII verb ishtaraka 'to cooperate, participate', and in turn its verbal noun ištirāk 'cooperation, participation' can be formed. This in turn can be made into a nisbah adjective ištirākiyy 'socialist', from which an abstract noun ishtirākiyyah 'socialism' can be derived. Other recent formations are jumhūriyyah 'republic' (lit. "public-ness", < jumhūr 'multitude, general public'), and the Gaddafi-specific variation jamāhīriyyah 'people's republic' (lit. "masses-ness", < jamāhīr 'the masses', pl. of jumhūr, as above).
Colloquial varieties
[edit]The spoken dialects have lost the case distinctions and make only limited use of the dual (it occurs only on nouns and its use is no longer required in all circumstances). They have lost the mood distinctions other than imperative, but many have since gained new moods through the use of prefixes (most often /bi-/ for indicative vs. unmarked subjunctive). They have also mostly lost the indefinite "nunation" and the internal passive.
The following is an example of a regular verb paradigm in Egyptian Arabic.
| Tense/Mood | Past | Present Subjunctive | Present Indicative | Future | Imperative | |
|---|---|---|---|---|---|---|
| Singular | ||||||
| 1st | katáb-t | á-ktib | bá-ktib | ḥá-ktib | " | |
| 2nd | masculine | katáb-t | tí-ktib | bi-tí-ktib | ḥa-tí-ktib | í-ktib |
| feminine | katáb-ti | ti-ktíb-i | bi-ti-ktíb-i | ḥa-ti-ktíb-i | i-ktíb-i | |
| 3rd | masculine | kátab | yí-ktib | bi-yí-ktib | ḥa-yí-ktib | " |
| feminine | kátab-it | tí-ktib | bi-tí-ktib | ḥa-tí-ktib | ||
| Plural | ||||||
| 1st | katáb-na | ní-ktib | bi-ní-ktib | ḥá-ní-ktib | " | |
| 2nd | katáb-tu | ti-ktíb-u | bi-ti-ktíb-u | ḥa-ti-ktíb-u | i-ktíb-u | |
| 3rd | kátab-u | yi-ktíb-u | bi-yi-ktíb-u | ḥa-yi-ktíb-u | " | |
Writing system
[edit]
The Arabic alphabet derives from the Aramaic through Nabatean, to which it bears a loose resemblance like that of Coptic or Cyrillic scripts to Greek script. Traditionally, there were several differences between the Western (North African) and Middle Eastern versions of the alphabet—in particular, the faʼ had a dot underneath and qaf a single dot above in the Maghreb, and the order of the letters was slightly different (at least when they were used as numerals).
However, the old Maghrebi variant has been abandoned except for calligraphic purposes in the Maghreb itself, and remains in use mainly in the Quranic schools (zaouias) of West Africa. Arabic, like all other Semitic languages (except for the Latin-written Maltese, and the languages with the Ge'ez script), is written from right to left. There are several styles of scripts such as thuluth, muhaqqaq, tawqi, rayhan, and notably naskh, which is used in print and by computers, and ruqʻah, which is commonly used for correspondence.[125][126]
Originally Arabic was made up of only rasm without diacritical marks[127] Later diacritical points (which in Arabic are referred to as nuqaṯ) were added (which allowed readers to distinguish between letters such as b, t, th, n and y). Finally signs known as Tashkil were used for short vowels known as harakat and other uses such as final postnasalized or long vowels.
| Arabic Alphabet | ||||||
|---|---|---|---|---|---|---|
| Wikipedia
Romanization |
Value in MSA
(IPA) |
Contextual forms | Isolated form | No. | ||
| Final | Medial | Initial | ||||
| ā | /aː/ | ـا | ا | 1 | ||
| b | /b/ | ـب | ـبـ | بـ | ب | 2 |
| t | /t/ | ـت | ـتـ | تـ | ت | 3 |
| ṯ or th | /θ/ | ـث | ـثـ | ثـ | ث | 4 |
| j | /d͡ʒ/* | ـج | ـجـ | جـ | ج | 5 |
| ḥ | /ħ/ | ـح | ـحـ | حـ | ح | 6 |
| ḵ or kh | /x/ | ـخ | ـخـ | خـ | خ | 7 |
| d | /d/ | ـد | د | 8 | ||
| ḏ or dh | /ð/ | ـذ | ذ | 9 | ||
| r | /r/ | ـر | ر | 10 | ||
| z | /z/ | ـز | ز | 11 | ||
| s | /s/ | ـس | ـسـ | سـ | س | 12 |
| š or sh | /ʃ/ | ـش | ـشـ | شـ | ش | 13 |
| ṣ | /sˤ/ | ـص | ـصـ | صـ | ص | 14 |
| ḍ | /dˤ/ | ـض | ـضـ | ضـ | ض | 15 |
| ṭ | /tˤ/ | ـط | ـطـ | طـ | ط | 16 |
| ẓ | /ðˤ/ | ـظ | ـظـ | ظـ | ظ | 17 |
| ʻ or ʕ | /ʕ/ | ـع | ـعـ | عـ | ع | 18 |
| ḡ or gh | /ɣ/ | ـغ | ـغـ | غـ | غ | 19 |
| f | /f/ | ـف | ـفـ | فـ | ف | 20 |
| q | /q/ | ـق | ـقـ | قـ | ق | 21 |
| k | /k/ | ـك | ـكـ | كـ | ك | 22 |
| l | /l/ | ـل | ـلـ | لـ | ل | 23 |
| m | /m/ | ـم | ـمـ | مـ | م | 24 |
| n | /n/ | ـن | ـنـ | نـ | ن | 25 |
| h | /h/ | ـه | ـهـ | هـ | ﻩ | 26 |
| w and ū | /w/, /uː/ | ـو | و | 27 | ||
| y and ī | /j/, /iː/ | ـي | ـيـ | يـ | ي | 28 |
| ʾ or ʔ | /ʔ/ | ء | - | |||
Notes:
- Modern Standard Arabic (Literary Arabic) ⟨ج⟩ can be pronounced /d͡ʒ/ or /ʒ/ (or /g/ only in Egypt) depending on the speaker's regional dialect.
- The Hamza ⟨ء⟩ can be considered a letter and plays an important role in Arabic spelling but it is not considered part of the alphabet, it has different written forms depending on its position in the word, check Hamza.
Calligraphy
[edit]After Khalil ibn Ahmad al Farahidi finally fixed the Arabic script around 786, many styles were developed, both for the writing down of the Quran and other books, and for inscriptions on monuments as decoration.
Arabic calligraphy has not fallen out of use as calligraphy has in the Western world, and is still considered by Arabs as a major art form; calligraphers are held in great esteem. Being cursive by nature, unlike the Latin script, Arabic script is used to write down a verse of the Quran, a hadith, or a proverb. The composition is often abstract, but sometimes the writing is shaped into an actual form such as that of an animal. One of the current masters of the genre is Hassan Massoudy.[128]
In modern times the intrinsically calligraphic nature of the written Arabic form is haunted by the thought that a typographic approach to the language, necessary for digitized unification, will not always accurately maintain meanings conveyed through calligraphy.[129]
Romanization
[edit]There are a number of different standards for the romanization of Arabic, i.e. methods of accurately and efficiently representing Arabic with the Latin script. There are various conflicting motivations involved, which leads to multiple systems. Some are interested in transliteration, i.e. representing the spelling of Arabic, while others focus on transcription, i.e. representing the pronunciation of Arabic. (They differ in that, for example, the same letter ي is used to represent both a consonant, as in "you" or "yet", and a vowel, as in "me" or "eat".)
Some systems, e.g. for scholarly use, are intended to accurately and unambiguously represent the phonemes of Arabic, generally making the phonetics more explicit than the original word in the Arabic script. These systems are heavily reliant on diacritical marks such as "š" for the sound equivalently written sh in English. Other systems (e.g. the Bahá'í orthography) are intended to help readers who are neither Arabic speakers nor linguists with intuitive pronunciation of Arabic names and phrases.[citation needed]
These less "scientific" systems tend to avoid diacritics and use digraphs (like sh and kh). These are usually simpler to read, but sacrifice the definiteness of the scientific systems, and may lead to ambiguities, e.g. whether to interpret sh as a single sound, as in gash, or a combination of two sounds, as in gashouse. The ALA-LC romanization solves this problem by separating the two sounds with a prime symbol ( ′ ); e.g., as′hal 'easier'.
During the last few decades and especially since the 1990s, Western-invented text communication technologies have become prevalent in the Arab world, such as personal computers, the World Wide Web, email, bulletin board systems, IRC, instant messaging and mobile phone text messaging. Most of these technologies originally had the ability to communicate using the Latin script only, and some of them still do not have the Arabic script as an optional feature. As a result, Arabic speaking users communicated in these technologies by transliterating the Arabic text using the Latin script.
To handle those Arabic letters that cannot be accurately represented using the Latin script, numerals and other characters were appropriated. For example, the numeral "3" may be used to represent the Arabic letter ⟨ع⟩. There is no universal name for this type of transliteration, but some have named it Arabic Chat Alphabet or IM Arabic. Other systems of transliteration exist, such as using dots or capitalization to represent the "emphatic" counterparts of certain consonants. For instance, using capitalization, the letter ⟨د⟩, may be represented by d. Its emphatic counterpart, ⟨ض⟩, may be written as D.
Numerals
[edit]In most of present-day North Africa, the Western Arabic numerals (0, 1, 2, 3, 4, 5, 6, 7, 8, 9) are used. However, in Egypt and Arabic-speaking countries to the east of it, the Eastern Arabic numerals (٠ – ١ – ٢ – ٣ – ٤ – ٥ – ٦ – ٧ – ٨ – ٩) are in use. When representing a number in Arabic, the lowest-valued position is placed on the right, so the order of positions is the same as in left-to-right scripts. Sequences of digits such as telephone numbers are read from left to right, but numbers are spoken in the traditional Arabic fashion, with units and tens reversed from the modern English usage. For example, 24 is said "four and twenty" just like in the German language (vierundzwanzig) and Classical Hebrew, and 1975 is said "a thousand and nine-hundred and five and seventy" or, more eloquently, "a thousand and nine-hundred five seventy".
Arabic alphabet and nationalism
[edit]There have been many instances of national movements to convert Arabic script into Latin script or to Romanize the language. Currently, the only Arabic variety to use Latin script is Maltese.
Lebanon
[edit]The Beirut newspaper La Syrie pushed for the change from Arabic script to Latin letters in 1922. The major head of this movement was Louis Massignon, a French Orientalist, who brought his concern before the Arabic Language Academy in Damascus in 1928. Massignon's attempt at Romanization failed as the academy and population viewed the proposal as an attempt from the Western world to take over their country. Sa'id Afghani, a member of the academy, mentioned that the movement to Romanize the script was a Zionist plan to dominate Lebanon.[130][131] Said Akl created a Latin-based alphabet for Lebanese and used it in a newspaper he founded, Lebnaan, as well as in some books he wrote.
Egypt
[edit]After the period of colonialism in Egypt, Egyptians were looking for a way to reclaim and re-emphasize Egyptian culture. As a result, some Egyptians pushed for an Egyptianization of the Arabic language in which the formal Arabic and the colloquial Arabic would be combined into one language and the Latin alphabet would be used.[130][131] There was also the idea of finding a way to use Hieroglyphics instead of the Latin alphabet, but this was seen as too complicated to use.[130][131]
A scholar, Salama Musa agreed with the idea of applying a Latin alphabet to Arabic, as he believed that would allow Egypt to have a closer relationship with the West. He also believed that Latin script was key to the success of Egypt as it would allow for more advances in science and technology. This change in alphabet, he believed, would solve the problems inherent with Arabic, such as a lack of written vowels and difficulties writing foreign words that made it difficult for non-native speakers to learn.[130][131] Ahmad Lutfi As Sayid and Muhammad Azmi, two Egyptian intellectuals, agreed with Musa and supported the push for Romanization.[130][132]
The idea that Romanization was necessary for modernization and growth in Egypt continued with Abd Al-Aziz Fahmi in 1944. He was the chairman for the Writing and Grammar Committee for the Arabic Language Academy of Cairo.[130][132] This effort failed as the Egyptian people felt a strong cultural tie to the Arabic alphabet.[130][132] In particular, the older Egyptian generations believed that the Arabic alphabet had strong connections to Arab values and history, due to the long history of the Arabic alphabet (Shrivtiel, 189) in Muslim societies.
Sample text
[edit]| Modern Standard Arabic, Arabic script[133] | ALA-LC transliteration | English[134] |
|---|---|---|
يولد جميع الناس أحراراً متساوين في الكرامة والحقوق، وقد وهبوا عقلاً وضميراً وعليهم أن يعامل بعضهم بعضاً بروح الإخاء.
|
Yūlad jamīʻ al-nās aḥrār-an mutasāwīn fil-karāma-ti wal-huqūq-i, wa-qad wuhibū ʻaql-an wa-ḍamīr-an wa-ʻalayhim an yuʻāmil-u baʻduhum baʻd-an bi-rūh al-ikhāʼ-i. | All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood. |
See also
[edit]- Arabic Ontology
- Arabic diglossia
- Arabic language influence on the Spanish language
- Arabic Language International Council
- Arabic literature
- Arabic–English Lexicon
- Arabist
- A Dictionary of Modern Written Arabic
- Glossary of Islam
- International Association of Arabic Dialectology
- List of Arab newspapers
- List of Arabic-language television channels
- List of Arabic given names
- List of countries where Arabic is an official language
- Arabic-based creole languages
- Varieties of Arabic
- List of French words of Arabic origin
- Replacement of loanwords in Turkish
Notes
[edit]- ^ The constitution of the Islamic Republic of Iran recognizes the Arabic language as the language of Islam, giving it a formal status as the language of religion, and regulates its spreading within the Iranian national curriculum. The constitution declares in Chapter II: (The Official Language, Script, Calendar, and Flag of the Country) in Article 16 "Since the language of the Qur`an and Islamic texts and teachings is Arabic, ..., it must be taught after elementary level, in all classes of secondary school and in all areas of study."[4]
- ^ The constitution of the Islamic Republic of Pakistan states in Article 31 No. 2 that "The State shall endeavour, as respects the Muslims of Pakistan (a) to make the teaching of the Holy Quran and Islamiat compulsory, to encourage and facilitate the learning of Arabic language ..."[5]
- ^ endonym: اَلْعَرَبِيَّةُ, romanized: al-ʿarabiyyah, pronounced [al ʕaraˈbijːa] ⓘ, or عَرَبِيّ, ʿarabīy, pronounced [ˈʕarabiː] ⓘ or [ʕaraˈbij]
References
[edit]Citations
[edit]- ^ a b Arabic at Ethnologue (28th ed., 2025)
- ^ Arabic at Ethnologue (28th ed., 2025)
- ^ "Eritrea", The World Factbook, Central Intelligence Agency, 26 April 2023, retrieved 29 April 2023
- ^ Constitution of the Islamic Republic of Iran: Iran (Islamic Republic of)'s Constitution of 1979. – Article: 16 Official or national languages, 1979, retrieved 25 July 2018
- ^ Constitution of Pakistan: Constitution of Pakistan, 1973 – Article: 31 Islamic way of life, 1973, retrieved 13 June 2018
- ^ "Implementation of the Charter in Cyprus". Database for the European Charter for Regional or Minority Languages. Public Foundation for European Comparative Minority Research. Archived from the original on 24 October 2011. Retrieved 20 May 2013.
- ^ "Basic Law: Israel – The Nation State of the Jewish People" (PDF). Knesset. 19 July 2018. Archived (PDF) from the original on 10 April 2021. Retrieved 13 January 2021.
- ^ "Mali". www.axl.cefan.ulaval.ca. Retrieved 29 April 2023.
- ^ "Niger : Loi n° 2001-037 du 31 décembre 2001 fixant les modalités de promotion et de développement des langues nationales". www.axl.cefan.ulaval.ca (in French). Retrieved 29 April 2023.
- ^ Constitution of the Philippines, Article XIV, Sec 7: For purposes of communication and instruction, the official languages of the Philippines are Filipino and, until otherwise provided by law, English. The regional languages are the auxiliary official languages in the regions and shall serve as auxiliary media of instruction therein. Spanish and Arabic shall be promoted on a voluntary and optional basis.
- ^ a b "Decret n° 2005-980 du 21 octobre 2005". Archived from the original on 18 May 2015. Retrieved 10 December 2021.
- ^ The Constitution of the Republic of South Africa (PDF) (2013 English version ed.). Constitutional Court of South Africa. 2013. ch. 1, s. 6. Archived (PDF) from the original on 23 August 2018. Retrieved 17 April 2020.
- ^ a b c d e Al-Jallad, Ahmad (16 November 2015). "Al-Jallad. The earliest stages of Arabic and its linguistic classification". Routledge Handbook of Arabic Linguistics, forthcoming. p. 315. ISBN 9781315147062. Archived from the original on 23 October 2017. Retrieved 15 July 2016.
- ^ "Documentation for ISO 639 identifier: ara". Archived from the original on 3 March 2016. Retrieved 20 March 2018.
- ^ Kamusella, Tomasz (2017). "The Arabic Language: A Latin of Modernity?" (PDF). Journal of Nationalism, Memory & Language Politics. 11 (2): 117–145. doi:10.1515/jnmlp-2017-0006. hdl:10023/12443. ISSN 2570-5857. S2CID 158624482. Archived (PDF) from the original on 12 December 2019. Retrieved 28 June 2019.
- ^ Wright (2001:492)
- ^ a b "What are the official languages of the United Nations? - Ask DAG!". ask.un.org. Archived from the original on 5 February 2016. Retrieved 21 December 2019.
- ^ a b World, I. H. "Arabic". IH World. Retrieved 7 July 2021.
- ^ "Maltese language". Encyclopedia Britannica. Archived from the original on 24 September 2019. Retrieved 21 December 2019.
- ^ Versteegh, Kees; Versteegh, C. H. M. (1997). The Arabic Language. Columbia University Press. ISBN 9780231111522.
... of the Qufdn; many Arabic loanwords in the indigenous languages, as in Urdu and Indonesian, were introduced mainly through the medium of Persian.
- ^ Bhabani Charan Ray (1981). "Appendix B Persian, Turkish, Arabic words generally used in Oriya". Orissa Under the Mughals: From Akbar to Alivardi : a Fascinating Study of the Socio-economic and Cultural History of Orissa. Orissan studies project, 10. Calcutta: Punthi Pustak. p. 213. OCLC 461886299.
- ^ Lane, James (2 June 2021). "The 10 Most Spoken Languages In The World". Babbel. Retrieved 29 June 2021.
- ^ "Internet: most common languages online 2020". Statista. Retrieved 26 November 2021.
- ^ "Top Ten Internet Languages in The World - Internet Statistics". www.internetworldstats.com. Archived from the original on 7 September 2019. Retrieved 26 November 2021.
- ^ "Mandarin Chinese Most Useful Business Language After English - Bloomberg Business". Bloomberg News. 29 March 2015. Archived from the original on 29 March 2015. Retrieved 2 January 2022.
- ^ Versteegh, Cornelis Henricus Maria "Kees" (1997). The Arabic Language. Columbia University Press. p. 13. ISBN 978-0-231-11152-2.
- ^ a b c d Semitic languages: an international handbook / edited by Stefan Weninger; in collaboration with Geoffrey Khan, Michael P. Streck, Janet C. E.Watson; Walter de Gruyter GmbH & Co. KG, Berlin/Boston, 2011.
- ^ a b Al-Jallad 2020a, p. 8.
- ^ Huehnergard, John (2017). "Arabic in Its Semitic Context". In Al-Jallad, Ahmad (ed.). Arabic in Context: Celebrating 400 Years of Arabic at Leiden University. Brill. p. 13. doi:10.1163/9789004343047_002. ISBN 978-90-04-34304-7. OCLC 967854618.
- ^ Al-Jallad, Ahmad (2015). An Outline of the Grammar of the Safaitic Inscriptions. Brill. ISBN 978-90-04-28982-6. Archived from the original on 23 July 2016. Retrieved 17 July 2016.
- ^ Birnstiel 2019, p. 368.
- ^ Al-Jallad, Ahmad (2021). "Connecting the Lines between Old (Epigraphic) Arabic and the Modern Vernaculars". Languages. 6 (4): 1. doi:10.3390/languages6040173. ISSN 2226-471X.
- ^ Versteegh 2014, p. 172.
- ^ Macdonald, Michael C. A. "Arabians, Arabias, and the Greeks_Contact and Perceptions". Literacy and Identity in Pre-Islamic Arabia. pp. 15–17. ISBN 9781003278818. Archived from the original on 27 June 2023. Retrieved 5 May 2020.
- ^ Al-Jallad, Ahmad (January 2014). "Al-Jallad. 2014. On the genetic background of the Rbbl bn Hfʿm grave inscription at Qaryat al-Fāw". BSOAS. 77 (3): 445–465. doi:10.1017/S0041977X14000524.
- ^ Al-Jallad, Ahmad. "Al-Jallad (Draft) Remarks on the classification of the languages of North Arabia in the 2nd edition of The Semitic Languages (eds. J. Huehnergard and N. Pat-El)".[permanent dead link]
- ^ Al-Jallad, Ahmad. "One wāw to rule them all: the origins and fate of wawation in Arabic and its orthography".
- ^ Nehmé, Laila (January 2010). ""A glimpse of the development of the Nabataean script into Arabic based on old and new epigraphic material", in M.C.A. Macdonald (ed), The development of Arabic as a written language (Supplement to the Proceedings of the Seminar for Arabian Studies, 40). Oxford: 47–88". Supplement to the Proceedings of the Seminar for Arabian Studies.
- ^ Lentin, Jérôme (30 May 2011). "Middle Arabic". Encyclopedia of Arabic Language and Linguistics. Brill Reference. Archived from the original on 15 August 2016. Retrieved 17 July 2016.
- ^ Team, Almaany. "ترجمة و معنى نحو بالإنجليزي في قاموس المعاني. قاموس عربي انجليزي مصطلحات صفحة 1". www.almaany.com. Retrieved 26 May 2021.
- ^ Leaman, Oliver (2006). The Qur'an: An Encyclopedia. Taylor & Francis. ISBN 978-0-415-32639-1.
- ^ "Al-Khalīl ibn Aḥmad | Arab philologist". Encyclopedia Britannica. Retrieved 27 May 2021.
- ^ a b Versteegh, Kees (1997). "Ibn Maḍâ' and the refutation of the grammarians". Landmarks in linguistic thought III. Abingdon, UK: Taylor & Francis. pp. 140–152. doi:10.4324/9780203444153_chapter_11. ISBN 978-0-203-27565-8.
- ^ a b Al-Jallad, Ahmad (30 May 2011). "Polygenesis in the Arabic Dialects". Encyclopedia of Arabic Language and Linguistics. Brill Reference. Archived from the original on 15 August 2016. Retrieved 17 July 2016.
- ^ a b c "Examining the origins of Arabic ahead of Arabic Language Day". The National. 15 December 2016. Archived from the original on 20 April 2021. Retrieved 20 April 2021.
- ^ Stern, Josef; Robinson, James T.; Shemesh, Yonatan (15 August 2019). Maimonides' "Guide of the Perplexed" in Translation: A History from the Thirteenth Century to the Twentieth. University of Chicago Press. ISBN 978-0-226-45763-5.
- ^ Bernards, Monique, "Ibn Jinnī", in: Encyclopaedia of Islam, THREE, Edited by: Kate Fleet, Gudrun Krämer, Denis Matringe, John Nawas, Everett Rowson. Consulted online on 27 May 2021 First published online: 2021 First print edition: 9789004435964, 20210701, 2021–4
- ^ Baalbaki, Ramzi (28 May 2014). The Arabic Lexicographical Tradition: From the 2nd/8th to the 12th/18th Century. BRILL. ISBN 978-90-04-27401-3.
- ^ Versteegh 2014, p. 299.
- ^ Retsö, Jan (1989). Diathesis in the Semitic Languages: A Comparative Morphological Study. Brill. ISBN 978-90-04-08818-4. Archived from the original on 4 October 2018. Retrieved 16 May 2017.
- ^ Ibn Khaldūn (1967) [work in the original language written in 1377]. Dawood, N. J. (ed.). The Muqaddimah : An Introduction to History. Translated by Rosenthal, Franz. Princeton University Press (published 27 April 2015). ISBN 978-0-691-16628-5. OCLC 913459792.
- ^ a b Gelvin, James L. (2020). The Modern Middle East: A History (5th ed.). New York: Oxford University Press (published 28 February 2020). p. 112. ISBN 978-0-19-007406-7. OCLC 1122689432.
- ^ Okerson, Ann (2009). "Early Arabic Printing: Movable Type & Lithography". Yale University Library. Archived from the original on 18 February 2020. Retrieved 20 February 2020.
- ^ Hamzaoui, Rached (1975). L'Academie de Langue Arabe du Caire (in French). Publications de l'Université de Tunis. OCLC 462880236.
- ^ الشيال, جمال الدين. رفاعة الطهطاوي : زعيم النهضة الفكرية في عصر محمد علي. OCLC 1041872985.
- ^ Sawaie, Mohammed (30 May 2011). "Language Academies". Encyclopedia of Arabic Language and Linguistics. Archived from the original on 27 February 2021. Retrieved 20 February 2020.
- ^ a b c UNESCO (31 December 2019). بناء مجتمعات المعرفة في المنطقة العربية (in Arabic). UNESCO Publishing. ISBN 978-92-3-600090-9. Archived from the original on 5 April 2021. Retrieved 31 March 2021.
- ^ Tilmatine, Mohand (2015). "Arabization and linguistic domination: Berber and Arabic in the North of Africa". Language Empires in Comparative Perspective. Berlin, München, Boston: DE GRUYTER. pp. 1–16. doi:10.1515/9783110408362.1. ISBN 978-3-11-040836-2. S2CID 132791029.
- ^ Seri-Hersch, Iris (2 December 2020). "Arabization and Islamization in the Making of the Sudanese "Postcolonial" State (1946-1964)". Cahiers d'études africaines (240): 779–804. doi:10.4000/etudesafricaines.32202. ISSN 0008-0055. S2CID 229407091.
- ^ a b Kamusella, Tomasz Dominik (2017). "The Arabic Language: A Latin of Modernity?". Journal of Nationalism, Memory and Language Politics. 11 (2). De Gruyter: 117. doi:10.1515/jnmlp-2017-0006. hdl:10023/12443. ISSN 2570-5857.
- ^ a b Abdulkafi Albirini. 2016. Modern Arabic Sociolinguistics (pp. 34–35).
- ^ Kaye (1991:?)
- ^ "Arabic Language." Microsoft Encarta Online Encyclopedia 2009.
- ^ Trentman, E. and Shiri, S., 2020. The Mutual Intelligibility of Arabic Dialects. Critical Multilingualism Studies, 8(1), pp.104–134.
- ^ "linteau de porte". Musée du Louvre. 328. Archived from the original on 20 April 2021. Retrieved 20 April 2021.
- ^ Jenkins, Orville Boyd (18 March 2000), "Population Analysis of the Arabic Languages", Strategy Leader Resource Kit, archived from the original on 18 March 2009, retrieved 12 March 2009
- ^ "Morocco 2011 Constitution". Constitute. Retrieved 25 September 2022.
- ^ "Journal officiel de la republique du mali secretariat general du gouvernement – decret n°2023-0401/pt-rm du 22 juillet 2023 portant promulgation de la constitution" (PDF). sgg-mali.ml. 22 July 2023. Retrieved 26 July 2023.
Article 31 : Les langues nationales sont les langues officielles du Mali.
- ^ "Morocco 2011 Constitution, Article 5". www.constituteproject.org. Retrieved 18 July 2021.
- ^ Čéplö, Slavomír (1 January 2020). "Chapter 13 Maltese". Arabic and Contact-induced Change.
- ^ Hadjioannou, Xenia; Tsiplakou, Stavroula; Kappler, Matthias (2011). "Language policy and language planning in Cyprus". Current Issues in Language Planning. 12 (4). Routledge: 508. doi:10.1080/14664208.2011.629113. hdl:10278/29371. S2CID 143966308.
- ^ Arabic Language and Linguistics. Georgetown University Press. 2012. ISBN 9781589018853. JSTOR j.ctt2tt3zh.
- ^ Janet C.E. Watson, The Phonology and Morphology of Arabic Archived 14 April 2016 at the Wayback Machine, Introduction, p. xix. Oxford: Oxford University Press, 2007. ISBN 978-0-19-160775-2
- ^ Proceedings and Debates of the Archived 14 April 2016 at the Wayback Machine 107th United States Congress Congressional Record, p. 10,462. Washington, DC: United States Government Printing Office, 2002.
- ^ Shalom Staub, Yemenis in New York City: The Folklore of Ethnicity Archived 14 April 2016 at the Wayback Machine, p. 124. Philadelphia: Balch Institute for Ethnic Studies, 1989. ISBN 978-0-944190-05-0
- ^ Daniel Newman, Arabic-English Thematic Lexicon Archived 13 April 2016 at the Wayback Machine, p. 1. London: Routledge, 2007. ISBN 978-1-134-10392-8
- ^ Rebecca L. Torstrick and Elizabeth Faier, Culture and Customs of the Arab Gulf States Archived 14 April 2016 at the Wayback Machine, p. 41. Santa Barbara: ABC-CLIO, 2009. ISBN 978-0-313-33659-1
- ^ Walter J. Ong, Interfaces of the Word: Studies in the Evolution of Consciousness and Culture Archived 14 April 2016 at the Wayback Machine, p. 32. Ithaca, NY: Cornell University Press, 2012. ISBN 978-0-8014-6630-4
- ^ Clive Holes, Modern Arabic: Structures, Functions, and Varieties Archived 2 November 2022 at the Wayback Machine, p. 3. Washington, DC: Georgetown University Press, 2004. ISBN 978-1-58901-022-2
- ^ Nizar Y. Habash,Introduction to Arabic Natural Language Processing Archived 2 November 2022 at the Wayback Machine, pp. 1–2. San Rafael, CA: Morgan & Claypool, 2010. ISBN 978-1-59829-795-9
- ^ Bernard Bate, Tamil Oratory and the Dravidian Aesthetic: Democratic Practice in South India Archived 2 November 2022 at the Wayback Machine, pp. 14–15. New York: Columbia University Press, 2013. ISBN 978-0-231-51940-3
- ^ Versteegh 2014, p. 107.
- ^ Suleiman, p. 93 Archived 14 April 2016 at the Wayback Machine
- ^ M. Ed., Loyola University-Maryland; B. S., Child Development. "The Importance of the Arabic Language in Islam". Learn Religions. Archived from the original on 1 February 2009. Retrieved 7 January 2021.
- ^ Quesada, Thomas C. Arabic Keyboard (Atlanta ed.). Madisonville: Peter Jones. p. 49. Archived from the original on 27 September 2007. Retrieved 11 October 2012.
- ^ "Reviews of Language Courses". Lang1234. Retrieved 12 September 2012.
- ^ a b c d e f "Lexicography, Arabic". Encyclopaedia of Islam, THREE. Brill. 2020. doi:10.1163/1573-3912_ei3_com_35848.
- ^ Richards, D. S. (1999). "Edward Lane's Surviving Arabic Correspondence". Journal of the Royal Asiatic Society. 9 (1): 1–25. doi:10.1017/S135618630001590X. ISSN 1356-1863. JSTOR 25183625. S2CID 161420127.
- ^ "Lane, Edward William". Encyclopaedia of Islam. Brill. 2020. doi:10.1163/1573-3912_ei3_com_35793.
- ^ "Hedwig Klein and "Mein Kampf": The unknown Arabist - Qantara.de". Qantara.de – Dialogue with the Islamic World. 7 April 2018. Retrieved 15 June 2023.
- ^ Abu-Haidar, J. A. (1983). "Review of A Dictionary of Modern Written Arabic (Arabic-English)". Bulletin of the School of Oriental and African Studies, University of London. 46 (2): 351–353. doi:10.1017/S0041977X00079040. ISSN 0041-977X. JSTOR 615409. S2CID 162954225.
- ^ "المعجم التاريخي للعربية.. ضوء في عتمة الهوان". Hespress – هسبريس جريدة إلكترونية مغربية (in Arabic). 15 November 2020. Retrieved 31 March 2021.
- ^ von Grunebaum, G. E. (1959). "Review of Al-Muʿjam al-kabīr, Murad Kāmil, Ibrāhīm al-Ibyārī". Journal of Near Eastern Studies. 18 (2): 157–159. doi:10.1086/371525. ISSN 0022-2968. JSTOR 543279.
- ^ الجبر, خالد. "معجم الدوحة التاريخي للغة العربية.. الواقع الحقيقي للغة والحضارة". www.aljazeera.net (in Arabic). Retrieved 17 June 2023.
- ^ See the seminal study by Siegmund Fraenkel, Die aramäischen Fremdwörter im Arabischen, Leiden 1886 (repr. 1962)
- ^ See for instance Wilhelm Eilers, "Iranisches Lehngut im Arabischen", Actas IV. Congresso des Estudos Árabes et Islâmicos, Coimbra, Lisboa, Leiden 1971, with earlier references.
- ^ a b Lucas C, Manfredi S (2020). Lucas C, Manfredi S (eds.). Arabic and contact-induced change (pdf). Berlin: Language Science Press. doi:10.5281/zenodo.3744565. ISBN 978-3-96110-252-5. Archived from the original on 16 January 2021. Retrieved 7 January 2021.
- ^ PhD, D. Gershon Lewental. "Rasmī or aslī?: Arabic's impact on modern Israeli Hebrew by D Gershon Lewental, PhD (DGLnotes)". DGLnotes. Retrieved 27 November 2021.
- ^ "Top 50 English Words – of Arabic Origin". blogs.transparent.com. Arabic Language Blog. 21 February 2012. Archived from the original on 15 December 2018. Retrieved 14 December 2018.
- ^ EB staff. "Maltese language – Britannica Online Encyclopedia". Britannica.com. Archived from the original on 5 June 2008. Retrieved 4 May 2010.
- ^ Gregersen (1977:237)
- ^ Ferguson, Charles (1959), "The Arabic Koine", Language, 35 (4): 616–630, doi:10.2307/410601, JSTOR 410601
- ^ Arabic, Egyptian Spoken at Ethnologue (25th ed., 2022)
- ^ Levantine Arabic at Ethnologue (25th ed., 2022)
- ^ Arabic, Cypriot Spoken at Ethnologue (25th ed., 2022)
- ^ a b Borg, Albert J.; Azzopardi-Alexander, Marie (1997). Maltese. Routledge. ISBN 0-415-02243-6.
- ^ Borg and Azzopardi-Alexander (1997). Maltese. Routledge. p. xiii. ISBN 978-0-415-02243-9.
In fact, Maltese displays some areal traits typical of Maghrebine Arabic, although over the past 800 years of independent evolution it has drifted apart from Tunisian Arabic
- ^ Brincat, Joseph M. (February 2005). Maltese – an unusual formula. MED Magazine. Archived from the original on 8 December 2015. Retrieved 17 February 2018.
Originally Maltese was an Arabic dialect but it was immediately exposed to Latinisation because the Normans conquered the islands in 1090, while Christianisation, which was complete by 1250, cut off the dialect from contact with Classical Arabic. Consequently Maltese developed on its own, slowly but steadily absorbing new words from Sicilian and Italian according to the needs of the developing community.
- ^ Robert D Hoberman (2007). Morphologies of Asia and Africa, Alan S. Kaye (Ed.), Chapter 13: Maltese Morphology. Eisenbrown. ISBN 978-1-57506-109-2. Archived from the original on 4 October 2018.
Maltese is the chief exception: Classical or Standard Arabic is irrelevant in the Maltese linguistic community and there is no diglossia.
- ^ Robert D Hoberman (2007). Morphologies of Asia and Africa, Alan S. Kaye (Ed.), Chapter 13: Maltese Morphology. Eisenbrown. ISBN 978-1-57506-109-2. Archived from the original on 4 October 2018.
yet it is in its morphology that Maltese also shows the most elaborate and deeply embedded influence from the Romance languages, Sicilian and Italian, with which it has long been in intimate contact....As a result Maltese is unique and different from Arabic and other Semitic languages.
- ^ "Mutual Intelligibility of Spoken Maltese, Libyan Arabic and Tunisian Arabic Functionally Tested: A Pilot Study". p. 1. Archived from the original on 11 October 2017. Retrieved 23 September 2017.
To summarise our findings, we might observe that when it comes to the most basic everyday language, as reflected in our data sets, speakers of Maltese are able to understand less than a third of what is being said to them in either Tunisian or Benghazi Libyan Arabic.
- ^ "Mutual Intelligibility of Spoken Maltese, Libyan Arabic and Tunisian Arabic Functionally Tested: A Pilot Study". p. 1. Archived from the original on 11 October 2017. Retrieved 23 September 2017.
Speakers of Tunisian and Libyan Arabic are able to understand about 40% of what is said to them in Maltese.
- ^ "Mutual Intelligibility of Spoken Maltese, Libyan Arabic and Tunisian Arabic Functionally Tested: A Pilot Study". p. 1. Archived from the original on 11 October 2017. Retrieved 23 September 2017.
In comparison, speakers of Libyan Arabic and speakers of Tunisian Arabic understand about two-thirds of what is being said to them.
- ^ Isserlin (1986). Studies in Islamic History and Civilization, ISBN 965-264-014-X
- ^ Arabic, North Mesopotamian Spoken at Ethnologue (25th ed., 2022)
- ^ Müller-Kessler, Christa (2003). "Aramaic ?k?, lyk? and Iraqi Arabic ?aku, maku: The Mesopotamian Particles of Existence". Journal of the American Oriental Society. 123 (3): 641–646. doi:10.2307/3217756. ISSN 0003-0279. JSTOR 3217756.
- ^ Evi Nurus Suroiyah; Dewi Anisatuz Zakiyah (7 June 2021). "Perkembangan Bahasa Arab di Indonesia" [Development of Arabic in Indonesia]. Muhadasah: Jurnal Pendidikan Bahasa Arab (in Indonesian). 3 (1): 60–69. doi:10.51339/muhad.v3i1.302. ISSN 2721-9488.
- ^ Arabic, Tajiki Spoken at Ethnologue (25th ed., 2022)
- ^ "Central Asian Arabic: The Irano-Arabic Dynamics of a New Perfect", Linguistic Convergence and Areal Diffusion, Routledge, 2004, pp. 121–134, doi:10.4324/9780203327715-12, ISBN 9780203327715, retrieved 14 January 2023
- ^ "Tajiki Spoken Arabic", Endangered Languages, retrieved 14 January 2023
- ^ Kirchhoff & Vergyri (2005:38)
- ^ Kirchhoff & Vergyri (2005:38–39)
- ^ Holes (2004:57)
- ^ Rydin, Karin C. (2005). A reference grammar of Modern Standard Arabic. New York: Cambridge University Press.
- ^ Tabbaa, Yasser (1991). "The Transformation of Arabic Writing: Part I, Qur'ānic Calligraphy". Ars Orientalis. 21: 119–148. ISSN 0571-1371. JSTOR 4629416.
- ^ Hanna & Greis (1972:2)
- ^ Ibn Warraq (2002). Ibn Warraq (ed.). What the Koran Really Says : Language, Text & Commentary. Translated by Ibn Warraq. New York: Prometheus. p. 64. ISBN 157392945X. Archived from the original on 11 April 2019.
- ^ "Hassan Massoudy". Barjeel Art Foundation. Retrieved 1 June 2024.
- ^ Osborn, J.R. (2009). "Narratives of Arabic Script: Calligraphic Design and Modern Spaces". Design and Culture. 1 (3): 289–306. doi:10.1080/17547075.2009.11643292. S2CID 147422407.
- ^ a b c d e f g Shrivtiel, Shraybom (1998). The Question of Romanisation of the Script and The Emergence of Nationalism in the Middle East. Mediterranean Language Review. pp. 179–196.
- ^ a b c d Shrivtiel, p. 188
- ^ a b c Shrivtiel, p. 189
- ^ "OHCHR | Universal Declaration of Human Rights - Arabic (Alarabia)".
- ^ "Universal Declaration of Human Rights". United Nations.
Sources
[edit]- Al-Jallad, Ahmad (2020a). A Manual of the Historical Grammar of Arabic. doi:10.1017/S0041977X14000524. Archived from the original on 21 December 2019. Retrieved 16 July 2021 – via Academia.
- As-Sabil, archived from the original on 25 April 2016, retrieved 22 June 2016
- Bateson, Mary Catherine (2003), Arabic Language Handbook, Georgetown University Press, ISBN 978-0-87840-386-8
- Birnstiel, Daniel (2019). "Chapter 15: Classical Arabic". In Huehnergard, John; Pat-El, Na'ama (eds.). The Semitic Languages (2nd ed.). Routledge. pp. 367–402. doi:10.4324/9780429025563. ISBN 978-0-415-73195-9. OCLC 1103311755. S2CID 166512720.
- Durand, Olivier; Langone, Angela D.; Mion, Giuliano (2010), Corso di Arabo Contemporaneo. Lingua Standard (in Italian), Milan: Hoepli, ISBN 978-88-203-4552-5
- Gregersen, Edgar A. (1977), Language in Africa, CRC Press, ISBN 978-0-677-04380-7
- Grigore, George (2007), L'arabe parlé à Mardin. Monographie d'un parler arabe périphérique, Bucharest: Editura Universitatii din Bucuresti, ISBN 978-973-737-249-9, archived from the original on 27 September 2007
- Hanna, Sami A.; Greis, Naguib (1972), Writing Arabic: A Linguistic Approach, from Sounds to Script, Brill Archive, ISBN 978-90-04-03589-8
- Haywood; Nahmad (1965), A new Arabic grammar, London: Lund Humphries, ISBN 978-0-85331-585-8
- Hetzron, Robert (1997), The Semitic languages (Illustrated ed.), Taylor & Francis, ISBN 978-0-415-05767-7
- Holes, Clive (2004). Modern Arabic: Structures, Functions, and Varieties. Georgetown University Press. ISBN 978-1-58901-022-2.
- Irwin, Robert (2006), For Lust of Knowing, London: Allen Lane
- Kaplan, Robert B.; Baldauf, Richard B. (2007), Language Planning and Policy in Africa, Multilingual Matters, ISBN 978-1-85359-726-8
- Kaye, Alan S. (1991), "The Hamzat al-Waṣl in Contemporary Modern Standard Arabic", Journal of the American Oriental Society, 111 (3): 572–574, doi:10.2307/604273, JSTOR 604273
- Kirchhoff, Katrin; Vergyri, Dimitra (2005). "Cross-dialectal data sharing for acoustic modeling in Arabic speech recognition". Speech Communication. 46 (1): 37–51. doi:10.1016/j.specom.2005.01.004.
- Lane, Edward William (1893), Arabic–English Lexicon (2003 reprint ed.), New Delhi: Asian Educational Services, ISBN 978-81-206-0107-9, archived from the original on 10 December 2013
{{citation}}: ISBN / Date incompatibility (help) - Lipinski, Edward (1997), Semitic Languages, Leuven: Peeters
- Mion, Giuliano (2007), La Lingua Araba (in Italian), Rome: Carocci, ISBN 978-88-430-4394-1
- Mumisa, Michael (2003), Introducing Arabic, Goodword Books, ISBN 978-81-7898-211-3
- Procházka, S. (2006), ""Arabic"", Encyclopedia of Language and Linguistics (2nd ed.)
- Ryding, Karin Christina; Wilmsen, David, eds. (2021). The Cambridge Handbook of Arabic Linguistics. New York: Cambridge. ISBN 978-1-108-41730-3.
- Steingass, Francis Joseph (1993), Arabic–English Dictionary, Asian Educational Services, ISBN 978-81-206-0855-9, archived from the original on 3 April 2013, retrieved 21 September 2020
- Suileman, Yasir. Arabic, Self and Identity: A Study in Conflict and Displacement. Oxford University Press, 2011. ISBN 0-19-974701-6, 978-0-19-974701-6.
- Thelwall, Robin (2003). "Arabic". Handbook of the International Phonetic Association a guide to the use of the international phonetic alphabet. Cambridge: Cambridge University Press. ISBN 978-0-521-63751-0.
- Traini, R. (1961), Vocabolario di arabo [Dictionary of Modern Written Arabic] (in Italian), Rome: I.P.O. – via Harassowitz
- Vaglieri, Laura Veccia, Grammatica teorico-pratica della lingua araba, Rome: I.P.O.
- Versteegh, C. H. M. (2014). The Arabic Language. Edinburgh University Press. ISBN 978-0-7486-4528-2. OCLC 872980196.
- Watson, Janet (2002), The Phonology and Morphology of Arabic, New York: Oxford University Press, ISBN 978-0-19-824137-9
- Wehr, Hans (1952), Arabisches Wörterbuch für die Schriftsprache der Gegenwart: Arabisch-Deutsch (1985 reprint (English) ed.), Harassowitz, ISBN 978-3-447-01998-9
{{citation}}: ISBN / Date incompatibility (help) - Wright, John W. (2001), The New York Times Almanac 2002, Routledge, ISBN 978-1-57958-348-4
Further reading
[edit]- Al Malwi, Ibrahim; Herrero De Haro, Alfredo; Baker, Amanda (2023). "Abha Arabic". Illustrations of the IPA. Journal of the International Phonetic Association: 1–19. doi:10.1017/S0025100323000269, with supplementary sound recordings.
External links
[edit]- Standard Arabic version of Wikipedia
- Egyptian Arabic version of Wikipedia
- Moroccan Arabic version of Wikipedia
- Algerian Arabic test version of Wikipedia at Wikimedia Incubator
- Hassaniya Arabic test version of Wikipedia at Wikimedia Incubator
- Levantine Arabic test version of Wikipedia at Wikimedia Incubator
- Tunisian Arabic test version of Wikipedia at Wikimedia Incubator
Arabic
View on GrokipediaLinguistic Classification
Semitic Roots and Family Relations
Arabic belongs to the Semitic branch of the Afro-Asiatic language family, a phylum encompassing languages spoken across North Africa, the Horn of Africa, and the Middle East.[8] The Semitic languages share a common ancestor in Proto-Semitic, reconstructed as having been spoken approximately 5,750 to 6,350 years ago based on comparative linguistic evidence from attested forms like Akkadian and Eblaite.[9] Key Proto-Semitic features preserved in Arabic include a triconsonantal root system for deriving words, case endings in nouns (nominative, accusative, genitive), and a rich morphology with patterns like the imperfective verb stem prefixed by ya-.[8] Within the Semitic family, traditional classifications divide languages into East Semitic (e.g., Akkadian, extinct by around 100 CE) and West Semitic, with the latter further splitting into Central and South branches.[10] Arabic is positioned in the Central Semitic subgroup, which also includes Northwest Semitic languages such as Aramaic (with dialects persisting into the present day in communities like Assyrian and Mandean speakers), Hebrew (revived as Modern Hebrew since the late 19th century), Ugaritic (extinct by 1200 BCE), and Canaanite languages like Phoenician.[11] This grouping reflects shared innovations, such as the merger of Proto-Semitic ś and š into a single sibilant and the development of the "yaqtulu" perfective verb form.[10] Arabic's relations to other Semitic languages are evident in extensive cognates and structural parallels. For instance, the Arabic root s-l-m ("peace, submission") corresponds to Hebrew š-l-m (shalom, "peace") and Aramaic š-l-m-a ("peace"), tracing back to Proto-Semitic *šalām-. Similarly, basic vocabulary like "hand" (*yad- in Proto-Semitic, yad in Arabic and Hebrew) and "water" (*may- > mā' in Arabic) demonstrates deep lexical continuity.[8] Compared to South Semitic languages (e.g., Ge'ez in Ethiopia, with about 10 million speakers today), Arabic shows closer affinity to Northwest Semitic in verbal morphology, though some scholars debate whether Arabic forms a distinct "South Central" node or aligns more with ancient South Arabian languages like Sabaic, based on epigraphic evidence from Yemen dating to the 1st millennium BCE.[12] Arabic's phonology is notably conservative, retaining 28 of Proto-Semitic's approximately 29 consonants, including emphatics like ṣ and ḍ, which have shifted or merged in languages like Hebrew and Aramaic.[8] This preservation has made Classical Arabic a key resource for reconstructing Proto-Semitic, as noted in comparative studies emphasizing its unbroken attestation from the 4th century CE onward.[12] However, classifications remain contested; while most linguists affirm Central Semitic unity through shared isoglosses like the "aCCaC" noun pattern, alternative proposals suggest Arabic's independent evolution from a pre-Proto-Arabic stage around 1000 BCE, influenced by contact with neighboring dialects.Proto-Arabic and Early Forms
Proto-Arabic denotes the reconstructed proto-language ancestral to all later Arabic varieties, derived via comparative historical linguistics from attested Old Arabic inscriptions and contemporary dialects.[13] This reconstruction identifies shared innovations distinguishing Arabic from other Central Semitic languages, such as the merger of Proto-Semitic *ś and *s into s, and the development of the emphatic lateral *ḍ into ḍ.[8] Linguistic evidence places Proto-Arabic speakers among nomadic pastoralists in the northern Arabian Peninsula and Syro-Arabian desert fringes during the late 2nd millennium BCE to early 1st millennium CE, prior to the emergence of distinct Old Arabic dialects.[14] Early attested forms of Arabic, classified as Old Arabic, appear in epigraphic records from the 1st century BCE onward, primarily in the Ancient North Arabian scripts adapted for Arabic speech.[15] Safaitic, the most voluminous corpus, consists of over 30,000 graffiti inscriptions carved across the basaltic deserts of southern Syria, Jordan, and northern Saudi Arabia, dating from approximately the late 1st millennium BCE to the 4th century CE.[15] These texts document nomadic herders' daily life, invoking deities like Allāt and recording tribal affiliations, while exhibiting phonological and morphological traits transitional to Classical Arabic, including the anaphoric article ʔl- and broken plurals.[16] Hismaic inscriptions, a closely related variety, occur in southern Jordan's Hisma region, with fewer than 100 examples dated to the 1st-2nd centuries CE, sharing Safaitic script and linguistic features like the relative pronoun ḏū and sound shifts aligning with later Arabic.[15] Other early epigraphs, such as the Namara inscription of 328 CE near Damascus, represent the first unambiguously dated Arabic text in a derivative Aramaic script, commemorating the Lakhmid king Imruʾ al-Qays and displaying verbal syntax and vocabulary proximate to Quranic Arabic.[17] These pre-Islamic attestations, totaling thousands of short texts, reveal dialectal diversity among Bedouin groups but confirm a coalescing linguistic continuum by the 4th century CE, setting the stage for the standardization of Classical Arabic.[16]Historical Evolution
Pre-Islamic and Old Arabic
Old Arabic designates the varieties of the Arabic language attested prior to the Islamic era, spanning from the early first millennium BCE to the sixth century CE across the Arabian Peninsula and adjacent territories. Epigraphic records first emerge around the beginning of this period, primarily through short inscriptions in diverse scripts, reflecting interactions with neighboring Semitic languages such as Aramaic and South Arabian. These attestations indicate a dialect continuum rather than a unified standard, with linguistic features like the definite article *al- and certain verbal conjugations foreshadowing Classical Arabic.[15] The bulk of pre-Islamic evidence derives from nomadic graffiti in the Ancient North Arabian script family, particularly Safaitic, which comprises over 30,000 inscriptions dating from the first century BCE to the fourth century CE in the Syrian Desert, northern Jordan, and southern Syria. Safaitic texts, often carved on rocks by pastoralists, document daily concerns such as herding, raids, and invocations to deities, revealing phonetic shifts (e.g., g to j in some forms) and morphological traits distinct from but ancestral to later Arabic.[18] Similar corpora include Hismaic from southern Jordan and Thamudic variants from central and northern Arabia, both classified under Old Arabic due to shared innovations like the ʾallā negative particle.[19] Nabataean-script inscriptions provide additional northern evidence, blending Aramaic orthography with Arabic grammar; the Namara epitaph of 328 CE, honoring the Kindite king Imru' al-Qays, offers the earliest extended prose in Arabic, comprising seven lines praising his conquests from Mesopotamia to Yemen.[17] Earlier fragments, such as a possible pre-150 CE Nabataean-Arabic text, confirm the language's presence in trade hubs like Petra. Southern pre-Islamic Arabic appears sparser, influenced by Sabaic and Minaic, with transitional forms in Dadanitic and Taymanitic inscriptions from the northwest, dating to the sixth century BCE onward.[15] Pre-Islamic Arabic remained predominantly oral, with written use confined to epigraphy among traders and nomads, lacking extended literary works until post-Islamic codification. Archaeological finds, including a 470 CE inscription from Saudi Arabia in a Christian milieu, underscore Arabic's pre-Islamic vitality in diverse religious contexts, predating the Quran by over a century.[20] This epigraphic corpus, deciphered through comparative Semitics, reveals causal linguistic evolution driven by migration, trade, and substrate influences, rather than isolated development.[21]Emergence of Classical Arabic via Quran (7th Century)
Prior to the 7th century, Arabic manifested in diverse tribal dialects across the Arabian Peninsula, with limited written attestation in forms such as the Safaitic and Hismaic inscriptions, but lacking a unified literary standard.[8] These pre-Islamic varieties, often termed Old Arabic, included poetic traditions preserved orally in dialects like that of the Quraysh tribe in Mecca, yet they exhibited phonological and morphological variations that hindered cross-tribal comprehension.[22] The absence of a codified grammar or orthography meant that written records, such as the Namara inscription dated to 328 CE, represented localized epigraphic uses rather than a standardized language.[8] The revelation of the Quran to Muhammad between approximately 610 and 632 CE marked the pivotal consolidation of what became Classical Arabic, primarily drawing from the Quraysh dialect deemed purest by contemporaries.[23] This text, comprising 114 surahs in a rhythmic, rhymed prose (saj'), elevated specific linguistic features—including a rich case system (i'rab), complex root-based morphology, and precise syntax—into a fixed exemplar that transcended oral variability.[24] The Quran's composition in this dialect, coupled with its recitation in prayer and memorization, imposed a normative influence, as tribal Arabs recognized its unparalleled eloquence, prompting emulation in emerging written works.[25] By the mid-7th century, following Muhammad's death in 632 CE, the Quran's compilation under Caliph Uthman (r. 644–656 CE) into a standardized mushaf further entrenched this form, eliminating variant recitations and establishing orthographic conventions using the nascent Kufic script derived from Nabataean.[8] Early manuscripts, such as the Birmingham folios radiocarbon-dated to 568–645 CE, attest to the rapid dissemination and fidelity of this textual archetype, which served as the linguistic benchmark amid the initial Islamic expansions.[22] This process did not invent Arabic anew but crystallized an existing prestigious dialect into Classical Arabic, the vehicle for religious, legal, and literary expression, with subsequent grammarians referencing Quranic usage as authoritative.[26] The causal mechanism—divine revelation in human language, followed by institutional canonization—ensured its preservation against dialectal drift, distinguishing it from purely evolutionary linguistic shifts.Spread Through Islamic Conquests (7th-8th Centuries)
The Islamic conquests initiated after the death of Muhammad in 632 CE under the Rashidun Caliphs rapidly expanded Arab Muslim dominion from the Arabian Peninsula across the Middle East, North Africa, and into Persia and beyond, creating conditions for Arabic's initial dissemination as a language of governance and religion. By 651 CE, the Sassanid Empire had fallen, with key victories such as the Battle of Yarmouk in 636 CE securing the Levant and the conquest of Egypt completed by 642 CE, establishing administrative centers like Fustat where Arab garrisons promoted the use of Arabic among settlers and officials.[27] These military successes, often involving negotiated surrenders that preserved local religious practices under jizya taxation, introduced Arabic through Quranic recitation, military commands, and early administrative records, though vernacular languages like Aramaic, Coptic, and Pahlavi persisted among conquered populations.[28] Under the Umayyad Caliphate (661–750 CE), Arabic's role intensified as Caliph Abd al-Malik (r. 685–705 CE) enacted reforms around 686 CE, mandating Arabic as the exclusive language for administration, coinage, and diplomacy across the empire stretching from Iberia to Central Asia. This policy replaced Greek, Persian, and other scripts in official papyri and diwans (bureaucratic offices), fostering an Arabized administrative elite and standardizing communication in provinces like Syria and Iraq, where Arab tribal settlements in amsar (garrison cities) such as Basra (founded 636 CE) and Kufa accelerated linguistic contact.[28] While mass conversions to Islam increased exposure to the Quran—recited solely in Arabic—full linguistic replacement was limited in the 8th century, confined largely to urban and military spheres, with rural and non-Muslim communities retaining indigenous tongues; for instance, Berber languages endured in the Maghreb despite conquests reaching modern Tunisia by 698 CE.[29] The linkage between Arabic's prestige as the language of the Quran and imperial utility drove its adoption, yet empirical evidence from surviving documents indicates uneven penetration: Greek and Coptic documents coexisted in Egypt into the 8th century, and Persian influences lingered in the east, underscoring that conquests provided the vector but social incentives like tax exemptions for converts and intermarriage propelled gradual vernacularization over subsequent eras.[30] This period marked Arabic's transition from a tribal dialect to an imperial lingua franca, laying groundwork for its enduring dominance in literate Muslim spheres without immediate erasure of substrate languages.Medieval Golden Age Contributions (8th-13th Centuries)
The Abbasid era's translation movement, centered in Baghdad from the late 8th century onward, systematically converted Greek, Syriac, Persian, and Indian texts into Arabic, enriching the language with thousands of neologisms and technical terms derived from or calqued upon foreign roots, such as al-jabr for algebra and kimiya for chemistry.[31] This effort, peaking under Caliph al-Ma'mun (r. 813–833), involved over 100 translators and produced Arabic versions of Aristotle's works, Euclid's Elements (translated by al-Hajjaj ibn Yusuf ibn Matar around 830), and Ptolemy's Almagest, establishing Arabic as the primary medium for interdisciplinary synthesis.[31] By the 10th century, original compositions in Arabic surpassed translations, with scholars composing treatises that integrated and extended prior knowledge, as seen in al-Khwarizmi's Kitab al-Jabr wa al-Muqabala (c. 820), which formalized algebraic methods using Arabic script for equations.[32] Linguistic standardization advanced through Sibawayh's Al-Kitab (completed c. 790), a 500,000-word compendium analyzing Quranic and poetic Arabic via 5,000+ examples, classifying roots, case endings, and verb patterns (i'rab) through empirical observation of Bedouin dialects rather than prescriptive rules.[33] This Basra school text, influencing later grammarians like al-Farra' (d. 822), preserved Classical Arabic (fusha) as a rigorous analytical tool, enabling precise expression in philosophy and science; for instance, it formalized ishtiqaq (derivation) to generate terms like falsafa from Greek influences.[34] By the 9th century, Arabic rhetoric (balagha) evolved to accommodate complex argumentation, as in al-Jahiz's Kitab al-Bayan wa al-Tabyin (c. 860), which dissected stylistic devices for persuasive prose.[31] In medicine, Arabic texts codified empirical methods: al-Razi's Kitab al-Hawi (c. 900–920), spanning 23 volumes, compiled 528 authors' observations into a reference work using Arabic indices and clinical trials, distinguishing measles from smallpox via symptoms.[31] Ibn Sina's Al-Qanun fi al-Tibb (1025), in five books, systematized pharmacology with 760 drugs tested against Galenic theory, employing Arabic logical terms like qiyas (deduction) for diagnostics.[31] Astronomy benefited from Arabic innovations, such as al-Battani's Kitab al-Zij (c. 900), refining Ptolemaic models with 489 observations yielding trigonometric tables accurate to 0.1 degrees for solar year length (365 days, 5 hours, 46 minutes).[32] Ibn al-Haytham's Kitab al-Manazir (1021) pioneered experimental optics, refuting emission theory through camera obscura tests described in Arabic geometric proofs.[32] These contributions, peaking before the Mongol sack of Baghdad in 1258, expanded Arabic's morphological capacity—adding prefixes like ta- for reflexivity and suffixes for abstraction—while fostering a diglossic divide, as vernaculars ('ammiyya) emerged in urban centers like Andalusia, yet Classical Arabic retained dominance in 80% of preserved manuscripts from Cordoba to Samarkand. Original Arabic output, including al-Farabi's logical treatises (c. 940) adapting syllogisms via burhan (demonstration), underscored causal reasoning over rote transmission, though later scholasticism (kalam) sometimes prioritized theology.[34] This era's 400+ surviving scientific works in Arabic laid groundwork for European Renaissance via Latin translations, without which fields like algebra would lack systematic notation.[32]Post-Golden Age Stagnation and Ottoman Influence (14th-19th Centuries)
The sack of Baghdad by Mongol forces in 1258 CE destroyed key centers of Arabic scholarship, including libraries housing vast collections of scientific and philosophical texts, effectively ending the Abbasid caliphate and contributing to a broader decline in original intellectual production.[35] This event symbolized the transition from the dynamic synthesis of Greek, Persian, and Indian knowledge in Arabic to a more insular focus on religious exegesis and jurisprudence, as patronage for rational sciences waned under subsequent regimes.[36] By the 13th century, the output of significant Arabic works in mathematics, astronomy, and medicine had sharply decreased, with Europe surpassing the Islamic world in scholarly advancements.[37] Under Mamluk rule (1250–1517 CE), which controlled Egypt, Syria, and the Hejaz, Arabic scholarship persisted in urban centers like Cairo and Damascus, but emphasized commentaries on earlier works rather than novel contributions, reflecting institutional priorities in madrasas that favored fiqh and hadith over philosophy or empirical inquiry.[36] Linguistic studies during this era produced grammatical treatises, such as those building on Sibawayh's 8th-century framework, yet these largely reiterated classical structures without substantive evolution, preserving fus'ha (eloquent Arabic) as a static liturgical and literary medium. Poetry and adab (belles-lettres) continued, with figures like Ibn Khaldun (d. 1406 CE) authoring historiographical works in Arabic that analyzed societal decline, but overall innovation stagnated amid political fragmentation and recurrent plagues.[38] The Ottoman conquest of Arab territories, beginning with Egypt in 1517 CE, integrated much of the Arabic-speaking world into a vast empire where Turkish served as the administrative and military lingua franca, while Arabic retained primacy in religious, legal, and scholarly domains as the language of the Quran and Sharia.[39] Ottoman Turkish, written in a modified Arabic script, incorporated extensive Arabic vocabulary—up to 88% in some registers—but exerted reciprocal influence primarily on colloquial Arabic dialects through loanwords related to governance, military, and daily life, such as Egyptian Arabic dulab (cupboard) from Turkish dolap or Levantine bashma' (pants) from paçama.[40] Classical Arabic grammar and rhetoric saw minimal development, with ulema in places like al-Azhar producing encyclopedic compilations rather than transformative texts, amid a cultural emphasis on orthodoxy that discouraged deviation from medieval precedents.[41] Technological factors compounded linguistic conservatism; the Ottoman Empire adopted the printing press slowly, with the first Muslim-operated press established in Istanbul in 1727 CE by Ibrahim Muteferrika for Turkish texts, while Arabic-script printing for religious works faced resistance from scribes and scholars until the late 18th century, limiting the dissemination of knowledge compared to Europe's post-Gutenberg proliferation.[42] This era thus entrenched diglossia, with fus'ha fossilized for elite and sacred uses while dialects absorbed Ottoman-era Turkisms and diverged further, setting the stage for 19th-century revival efforts amid European encroachment.[43]Nahda Revival and Modern Standardization (19th-20th Centuries)
The Nahda, an intellectual and cultural movement emerging in the early 19th century primarily in Ottoman Syria, Lebanon, and Egypt, revitalized Arabic as a vehicle for modern discourse by drawing on classical heritage while incorporating Western scientific and literary concepts. Triggered by factors including the proliferation of Arabic printing presses—beginning with limited Ottoman approvals in the 18th century and accelerating after 1828 with Egypt's al-Waqa'i' al-Misriyya newspaper—this period addressed Arabic's post-medieval stagnation through lexical expansion and stylistic simplification to handle topics like technology and governance.[44][45][46] Prominent reformers included Butrus al-Bustani (1819–1883), who established the National School in Beirut in 1863 as the first secular Arabic-medium institution for modern subjects and published the encyclopedic dictionary Muḥīṭ al-Muḥīṭ in 1870 to systematize vocabulary and promote linguistic unity amid sectarian divides.[47][48] Ahmad Fāris al-Shidyāq (1805–1887), after travels in Europe and service in Tunis, authored grammatical treatises like al-Jāsūs (1854) and al-Wasīṭah (1886), critiquing ornate medieval styles and advocating root-based neologisms to adapt Arabic for administrative and scientific use.[46][49] These efforts, often linked to Christian intellectuals exposed via missionary presses, fostered periodicals such as al-Bustani's al-Jinān (1870), which standardized fuṣḥā prose for public debate.[44] Transitioning into the 20th century, post-World War I Arab nationalism and independence movements intensified standardization to counter dialectal fragmentation and support unified education. The Arab Academy of Damascus, founded in 1919 under Emir Faisal, prioritized deriving technical terms from classical roots—producing over 1,000 neologisms by the 1930s for disciplines like physics and biology—while rejecting foreign loanwords where possible.[50][51] The Egyptian Language Academy, established in Cairo in 1932, similarly regulated grammar and orthography, influencing curricula across Arab states and media broadcasts.[51][52] Modern Standard Arabic (MSA), evolving from Nahda adaptations of Classical Arabic, emerged as a codified variety by the mid-20th century, retaining fusional morphology and diglossic status but with streamlined syntax for journalism and bureaucracy; for instance, the 1945 Arab League Charter reinforced its role in official communications among 22 member states.[53][54] Despite academy efforts, MSA's implementation varied, with persistent debates over purism versus pragmatism—evident in the 1960s adoption of terms like talfāz for "television" in some regions—reflecting causal tensions between linguistic heritage and technological imperatives.[52][55] This standardization, while enabling pan-Arab media like Radio Cairo's 1930s broadcasts, did not fully supplant dialects in speech, maintaining diglossia.[53]Varieties and Diglossia
Classical Arabic as Liturgical Standard
Classical Arabic functions as the fixed liturgical language of Islam, enshrined in the Quran and the ritual recitations of daily worship. The Quran, revealed to Muhammad between 610 and 632 CE, was composed in this variety of Arabic, which Muslims regard as its purest and most eloquent form, and it mandates recitation in the original tongue for spiritual efficacy.[56][57] This standardization ensures that the core texts and invocations remain unaltered, fostering doctrinal unity among over 1.8 billion adherents worldwide, irrespective of their native dialects.[58] In Islamic prayer (salah), performed five times daily by observant Muslims, key components such as the Fatiha surah and other Quranic verses are recited exclusively in Classical Arabic, with Arabic supplications (du'a) integrated into the rite. This practice, derived from the Prophet's example as recorded in hadith collections, underscores the language's sacral status, where deviation from the prescribed Arabic phrasing invalidates the prayer's validity according to major jurisprudential schools. Tajwid rules, codifying precise pronunciation and intonation, further preserve phonetic fidelity, transmitted orally through chains of authority (isnad) dating to the 7th century.[59][60][61] Beyond obligatory worship, Classical Arabic dominates religious scholarship, exegesis (tafsir), and legal deliberation (fiqh), where texts like hadith compilations by Bukhari (d. 870 CE) and Muslim (d. 875 CE) are analyzed in their original form. Friday congregational prayers include Quranic recitation in Classical Arabic, though sermons (khutba) may incorporate vernacular explanations. This diglossic role reinforces the language's endurance, as generations memorize the entire Quran (hifz), with millions achieving this feat annually, safeguarding against semantic drift.[62][63][64] The liturgical primacy of Classical Arabic also influences non-Arab Muslim communities, compelling study for ritual competence and deeper textual engagement, as translations are deemed interpretive aids rather than equivalents. Historical mechanisms, including Uthman's standardization of the codex around 650 CE and variant readings (qira'at) approved by consensus, have perpetuated its integrity amid evolving spoken forms.[65][66]Modern Standard Arabic (MSA)
Modern Standard Arabic (MSA), known in Arabic as al-fuṣḥā al-ʿarabiyya al-ḥadītha or contemporary fuṣḥā, constitutes the codified literary register of Arabic utilized for formal communication, encompassing official documents, scholarly publications, broadcast media, and educational curricula across the Arab world. It functions as a supradialectal standard, facilitating interoperability among speakers of mutually unintelligible vernaculars in over 20 countries, where Arabic serves as an official language for approximately 420 million individuals as of 2023 estimates. MSA's uniformity stems from its basis in the grammar, morphology, and core lexicon of Classical Arabic, while adapting to contemporary domains through neologisms derived from triconsonantal roots or calibrated loanwords, ensuring semantic precision without rupture from historical precedents.[54][67] The consolidation of MSA occurred primarily during the 19th and 20th centuries amid the Nahḍah (Arab Awakening), a period of cultural and intellectual resurgence triggered by encounters with European modernity and the imperative for administrative reform under Ottoman and colonial administrations. Language academies, such as Egypt's Majmaʿ al-Lughah al-ʿArabiyyah founded in 1892 and Syria's counterpart established in 1919, spearheaded lexicographical standardization, compiling dictionaries and regulating terminology for fields like mechanics, biology, and governance—efforts that yielded over 100,000 authenticated terms by the mid-20th century. This process preserved Classical Arabic's inflectional system, including nominative-accusative-genitive case markers (iʿrāb) in elevated registers, though practical orthographic conventions increasingly suspend diacritics (tashkīl) in non-Quranic texts to enhance readability, diverging minimally from Classical norms where full vocalization persists in religious exegesis. Native Arabic speakers typically perceive no categorical divide between MSA and Classical Arabic, referring to both as al-lughah al-ʿarabiyyah al-fuṣḥā, with divergences confined largely to lexical innovations rather than structural overhaul.[68][69][70] In practice, MSA dominates written domains—constituting the medium for newspapers, legal codes, and academic discourse—and formal oratory, such as parliamentary debates and Al Jazeera broadcasts, where anchors adhere to its phonemic inventory of 28 consonants (including pharyngeals and emphatics like /ḍ/, /ṭ/, /ṣ/, /ẓ/) and six vowels (/a, i, u/ short and long). Educational systems mandate MSA proficiency from primary levels, with curricula in countries like Saudi Arabia and Morocco allocating 20-30% of instructional time to its mastery, fostering a diglossic continuum wherein learners transition from vernacular acquisition to MSA literacy. Empirical surveys indicate that while MSA comprehension exceeds 90% among educated Arabs for passive exposure (e.g., news consumption), productive fluency wanes below 50% in informal settings due to dialectal interference, underscoring its role as an acquired, high-prestige code rather than a natively spoken vernacular. This diglossia imposes cognitive demands, as evidenced by studies showing slower processing speeds in MSA tasks versus dialects, yet reinforces cultural cohesion by enabling pan-Arab intellectual exchange unbound by regional fragmentation.[71][24][72]Spoken Dialects and Continuum
Spoken Arabic varieties, often termed colloquial or vernacular Arabic, serve as the primary means of everyday oral communication among over 370 million native speakers across the Arab world.[73] These varieties exhibit substantial divergence from Modern Standard Arabic (MSA) in phonology, morphology, syntax, and lexicon, rendering them largely mutually unintelligible with MSA without prior exposure.[74] In diglossic contexts, speakers code-switch between the high-prestige MSA for formal settings and low-prestige dialects for informal interactions, a phenomenon first systematically described in Arabic by Charles Ferguson in 1959.[75] The spoken varieties form a dialect continuum, where linguistic features transition gradually across geographic space, with high mutual intelligibility between neighboring forms but decreasing comprehension over greater distances.[76] For instance, dialects in adjacent regions like urban Syrian Levantine and rural Jordanian Arabic show near-complete intelligibility, while Maghrebi varieties spoken in Morocco differ markedly from Gulf Arabic in eastern Arabia, often requiring interpreters for fluid communication.[77] This continuum arises from historical migrations, trade routes, and substrate influences, preventing rigid boundaries and fostering hybrid bedouin-urban forms in transitional zones.[78] Key divergences include simplified grammatical structures, such as reduced case endings and dual forms absent in many dialects, alongside lexical borrowing from local languages like Berber in the Maghreb or Turkish in Mesopotamian varieties.[79] Phonetic shifts, like the merger of emphatic consonants or loss of interdentals, further distinguish spoken forms, with Egyptian Arabic's media dominance aiding partial comprehension for some listeners despite these variations.[80] Empirical studies on repetition priming reveal cognitive processing challenges between dialects and MSA, underscoring the continuum's impact on language acquisition and bilingualism in Arabic-speaking children.[75]Major Dialect Groups and Regional Variations
Arabic dialects exhibit significant regional variations, broadly classified into five major groups based on geography and linguistic features: Maghrebi, Egyptian, Levantine, Mesopotamian, and Peninsular.[81] These groupings reflect historical migrations, substrate influences, and innovations diverging from Classical Arabic, with mutual intelligibility often limited across groups but higher within them.[82] Bedouin and sedentary varieties further subdivide these, with Bedouin dialects preserving more conservative traits like case distinctions in some contexts. Maghrebi Arabic, spoken in Morocco, Algeria, Tunisia, and Libya, forms the westernmost group and shows heavy influence from Berber languages, including substrate vocabulary and phonology such as the realization of /q/ as [ɡ].[83] Distinctive features include simplified verb conjugations and extensive French loanwords in urban varieties due to colonial history.[84] This group is least mutually intelligible with eastern dialects, often requiring code-switching to Modern Standard Arabic for inter-regional communication.[85] Egyptian Arabic, dominant in Egypt and influencing Sudanese varieties, is characterized by glottal stops for /q/ and widespread media exposure via Egyptian cinema and television since the early 20th century, making it the most understood dialect across the Arab world.[86] Spoken by over 100 million people, it features innovative syntax like periphrastic negation (e.g., "ma...sh") and has absorbed Coptic and Ottoman Turkish elements.[87] Sudanese Arabic, sometimes grouped separately, diverges with Nilotic influences and retains more emphatic consonants.[83] Levantine Arabic encompasses dialects in Syria, Lebanon, Jordan, and Palestine, marked by the merger of short vowels /a/ and /i/ in open syllables and the use of /ʔ/ for /q/ in urban speech.[88] Urban varieties like Damascene and Beirut Arabic show French and Aramaic substrates, while rural and Bedouin forms preserve /g/ for /j/ in some areas.[89] This group benefits from relative homogeneity due to Ottoman-era urbanization. Mesopotamian Arabic, primarily in Iraq and eastern Syria, divides into gilit (urban, /g/ for /j/) and qeltu (Bedouin, /q/ retention) subtypes, with Assyrian and Persian influences evident in lexicon and phonetics like aspirated emphatics.[90] Features include complex aspectual systems and lower mutual intelligibility with peninsular dialects despite proximity. Peninsular Arabic covers the Arabian Peninsula, including Gulf, Najdi, and Yemeni varieties, with conservative traits in southern regions like dual verb forms and tribal-specific jargons.[91] Gulf dialects, spoken in Saudi Arabia, UAE, and Qatar, exhibit /χ/ and /ʁ/ mergers and English loanwords from oil-era globalization, while Yemeni retains archaic case endings in high-register speech.[92] Variations correlate with tribal migrations, with Najdi influencing central Saudi urban centers.[83]Phonology
Consonant Inventory and Emphatics
The consonant phonemes of Classical Arabic, which form the basis for Modern Standard Arabic (MSA), total 28 distinct sounds, encompassing stops, fricatives, affricates, nasals, liquids, and glides.[93] These are articulated across multiple places of articulation, from bilabial to uvular and glottal, with voicing contrasts in most series except glides and the glottal stop /ʔ/.[94] The inventory excludes phonemic /p/ and /v/, which appear only in loanwords, and includes uvular and pharyngeal sounds absent in many Indo-European languages.[95]| Manner/Place | Bilabial | Labiodental | Dental/Alveolar | Emphatic (Pharyngealized) | Postalveolar | Palatal | Velar | Uvular | Pharyngeal | Glottal |
|---|---|---|---|---|---|---|---|---|---|---|
| Stops | b | t, d | ṭ (tˤ), ḍ (dˤ) | k | q | ʔ | ||||
| Fricatives | f | θ, ð, s, z | ṣ (sˤ), ẓ (ðˤ) | ʃ | χ, ʁ | ħ, ʕ | h | |||
| Affricate | ||||||||||
| Nasal | m | n | ||||||||
| Lateral | l | (ɫ in emphatic contexts) | ||||||||
| Rhotic | r (trill) | |||||||||
| Glide | w | j |
Vowel System and Prosody
Modern Standard Arabic (MSA) and Classical Arabic feature a vowel system comprising three short vowels—/a/, /i/, and /u/—and their corresponding long vowels—/aː/, /iː/, and /uː/—with vowel length serving as a phonemic distinction that can alter word meaning, as in kataba (/kataba/, "he wrote") versus kātaba (/kaːtaba/, "he corresponded").[103][104] Long vowels are typically twice the duration of short ones and are orthographically represented by the letters alif (ā), yāʾ (ī), and wāw (ū), while short vowels are indicated by diacritics (fatḥa for /a/, kasra for /i/, ḍamma for /u*) in fully vocalized texts, though these are often omitted in everyday writing.[105][106] Additionally, two diphthongs occur: /aj/ (as in bayt, "house") and /aw/ (as in sawt, "voice"), which may monophthongize to /eː/ and /oː/ in certain contexts or dialects but remain distinct in standard pronunciation.[107][108] Arabic syllable structure is predominantly CV (consonant-vowel) or CVC, with no initial consonant clusters and limited codas, permitting heavy syllables (CVː or CVC) and superheavy syllables (CVVC or CVCC) that influence prosodic patterns.[109] Prosody in MSA is characterized by lexical stress that is predictable and non-phonemic, assigned via right-oriented rules prioritizing syllable weight: stress falls first on a final superheavy syllable (e.g., CVCC), then on the penultimate heavy syllable (CVV or CVC), and finally on the leftmost light syllable (CV) if no heavier syllables precede it, as in madrasa (stress on second syllable) or kitaab (stress on long vowel).[110][111] This quantity-sensitive system aligns with moraic theory, where heavy syllables carry two morae, contributing to rhythmic structure in poetry and recitation, such as in classical ʿarūḍ meters that quantify long syllables as equivalent to two short ones.[112] Intonation in declarative MSA utterances typically exhibits a declining fundamental frequency (F0) contour, with stress realized through increased duration, intensity, and pitch on the stressed syllable, while questions often feature rising intonation at phrase boundaries.[113][114] Prosodic phrasing groups words into accentual units, with boundaries marked by pauses or F0 resets, though variations occur across dialects; for instance, urban varieties may show flatter intonation compared to Bedouin ones.[115] Empirical acoustic studies confirm that stress correlates with 20-50% longer vowel duration in heavy syllables and heightened spectral energy, underscoring the language's reliance on temporal cues over lexical tone.[114] In Quranic recitation and formal speech, prosody adheres closely to these rules to preserve metrical fidelity, differing from casual dialects where vowel reduction or added phonemes can shift patterns.[108]Dialectal Phonetic Divergences
Arabic dialects exhibit substantial phonetic divergences from Classical Arabic (CA) and [Modern Standard Arabic](/page/Modern Standard Arabic) (MSA), primarily in the realization of consonants, driven by regional substrate influences, contact, and internal sound changes. These variations affect uvulars, interdentals, emphatics, and rhotics, often simplifying or altering CA phonemes while maintaining partial intelligibility within dialect continua.[116][117] The uvular stop /q/ in CA shows diverse reflexes across dialects: preserved as in some sedentary varieties like Syrian and Maghrebi; realized as glottal stop [ʔ] in urban Levantine and Egyptian Arabic; shifted to voiced velar stop [ɡ] in Bedouin dialects of the Arabian Peninsula and Egyptian contexts; or to in rural Levantine areas.[116] The affricate /dʒ/ (jīm) varies similarly, retaining [dʒ] in Bedouin Levantine dialects, becoming [ɡ] in Egyptian Arabic, and [ʒ] in urban Levantine and Moroccan varieties.[116][118] Interdental fricatives /θ/, /ð/, and emphatic /ðˤ/ frequently affricate or stop in sedentary dialects: [θ] and [ð] become and in Egyptian and urban Hijazi Arabic (e.g., Mecca, Jeddah), while fricative variants predominate in Bedouin and Najdi dialects of the Arabian Peninsula, with low rates of stop adoption even in contact zones (e.g., 1-12% [t/d] varying by age and gender).[116][119] In northern Mesopotamian dialects, they may shift to [s z].[116] Emphatic consonants, pharyngealized in CA, undergo mergers and shifts: ḍād /dˤ/ realizes as [dˤ] or [zˤ] in sedentary dialects like Cairene, contrasting with [ðˤ] in Bedouin Yemeni varieties; broader emphatic mergers occur, such as /ðˤ/ and /dˤ/ converging to [ðˤ] in some Saudi dialects.[116][120] The rhotic /r/ displays typological splits: plain (tap/trill) and emphatic [rˤ] as distinct phonemes in Maghrebi and Egyptian dialects; emphatic-default /rˤ/ with plain allophones in Levantine; plain-default /r/ with emphatic allophones in Mesopotamian and Peninsular; or uvular [ʁ] contrasting with alveolar in qeltu-Mesopotamian varieties like Mosul Arabic.[121]| Consonant (CA) | Common Dialectal Realizations | Example Dialects |
|---|---|---|
| /q/ | , [ʔ], [ɡ], | Syrian ; Urban Levantine/Egyptian [ʔ]; Bedouin Peninsula [ɡ]; Rural Levantine [116] |
| /dʒ/ | [dʒ], [ɡ], [ʒ] | Bedouin Levantine [dʒ]; Egyptian [ɡ]; Urban Levantine/Moroccan [ʒ][116] |
| /θ ð ðˤ/ | [θ ð ðˤ], [t d dˤ] | Bedouin/Najdi [θ ð ðˤ]; Urban Hijazi/Egyptian [t d dˤ][116][119] |
| /dˤ/ | [dˤ zˤ], [ðˤ] | Cairene [dˤ zˤ]; Yemeni Bedouin [ðˤ][116] |
| /r/ | vs [rˤ]; [ʁ] vs | Maghrebi/Egyptian split; Levantine emphatic-default; Mosul uvular contrast[121] |
Grammar and Morphology
Root-and-Pattern Derivational System
The root-and-pattern system forms the core of Arabic derivational morphology, whereby lexical items are generated by embedding consonantal roots—predominantly triliteral sequences of three consonants encoding a basic semantic field—into predefined templates known as awzān (patterns). These patterns incorporate short vowels, gemination (doubling of consonants), and affixes to yield verbs, nouns, adjectives, and other categories, enabling systematic expansion of vocabulary from a limited set of roots. For instance, the root k-t-b (related to writing) generates kataba ("he wrote," a basic verb), kitāb ("book," a nominal form), kātib ("writer," an active participle), and maktab ("office" or "desk," a locative noun).[122][123] This non-concatenative approach contrasts with Indo-European affixation, prioritizing internal vowel alternations and root intercalation for productivity.[124] Verbal derivation exemplifies the system's efficiency, with triliteral roots typically expanded into ten primary forms (I–X), each imposing a specific pattern and semantic modification such as causativity, reflexivization, or intensification. Form I represents the simplest, unmarked action; Form II often intensifies or causativizes it through gemination of the middle radical; Form IV introduces a prefixed ʾa- for external causation; Forms V–VI add a prefixed ta- for reflexive or reciprocal senses; Forms VII–VIII employ in- or ifta- for passivization or self-directed actions; Form IX, marked by gemination of the final radical, denotes inchoative states like color changes; and Form X uses ista- for desiderative or permissive nuances. Quadriliteral roots yield analogous but rarer forms. The following table outlines the past-tense patterns and typical meanings for the ten forms, using abstract radicals f-ʿ-l:[124]| Form | Past Pattern | Typical Meaning | Example (Root k-t-b) |
|---|---|---|---|
| I | faʿala | Basic action | kataba ("he wrote") |
| II | faʿʿala | Intensive/causative | katta ba ("he made write") |
| III | fāʿala | Reciprocal/associative | kā taba ("he corresponded") |
| IV | ʾafʿala | Causative | ʾaktaba ("he dictated") |
| V | tafaʿʿala | Reflexive of II | takatta ba ("he subscribed") |
| VI | tafāʿala | Reciprocal of III | takā taba ("they wrote to each other") |
| VII | infaʿala | Passive/reflexive | inkataba ("it was written") |
| VIII | iftaʿala | Reflexive/special | iktataba ("he copied") |
| IX | ifʿalla | Inchoative (e.g., color) | (Rare for this root) |
| X | istafʿala | Desiderative/permissive | ista ktaba ("he asked to write") |
Nominal and Adjectival Inflection
Arabic nouns and adjectives in Classical Arabic and Modern Standard Arabic inflect for four primary categories: case, gender, number, and definiteness. Case distinguishes nominative (used for subjects and predicates, marked by ḍammah -u in indefinite singular), accusative (for direct objects and after certain prepositions, marked by fatḥah -a), and genitive (for objects of prepositions and possessed nouns, marked by kasrah -i).[125] These short vowel endings, known as iʿrāb, apply to declinable (muʿrab) nouns and adjectives, while indeclinable (mabnī) forms like certain foreign words or participles lack them.[126] Gender divides nouns into masculine (default for most non-feminine-marked forms) and feminine (often marked by -atun or -ah in singular), with adjectives matching the noun's gender.[127] Number includes singular, dual (formed with -āni nominative, -ayni genitive-accusative for masculine; -atāni, -atayni for feminine), and plural.[128] Plural formation contrasts sound plurals, which add affixes without altering the root, and broken plurals, which involve internal vowel and pattern shifts. Sound masculine plurals use -ūna (nominative indefinite) or -īna (genitive-accusative), while sound feminine plurals employ -ātu (nominative) or -āti (genitive-accusative), typically for nouns ending in -ah.[128] Broken plurals, predominant for non-sound forms, follow over 30 patterns derived from the triconsonantal root system, such as fuʿūl (e.g., jumhūr from raʾīs, "leader" to "leaders"), afʿila (e.g., funūn from fann, "art" to "arts"), or fuʿalāʾ (e.g., ʿulamāʾ from ʿālim, "scholar" to "scholars").[129] These patterns are not fully predictable but cluster by semantic classes, like collectives or diminutives, with approximately 31 productive templates accounting for most occurrences in texts.[129] Definiteness is indicated by the prefix al- for definite forms, which suppresses case endings in pause but retains them in full declension; indefinite uses tanwīn (nunation doubling the case vowel).[127] Adjectives (ṣifa) follow the noun they modify and agree fully in case, gender, number, and definiteness, ensuring concord across the phrase. For instance, indefinite masculine singular kitābun kabīrun ("a big book") shifts to definite al-kitābu al-kabīru, feminine kitābatun kabīratun, or sound masculine plural kitābuna kabīruna.[130] Broken plural adjectives adopt the noun's plural pattern while preserving agreement, as in kutubun kabīratun for feminine-like broken plurals treated as such.[131] This agreement extends to dual and oblique cases, e.g., al-kitābayni al-kabīrayni (genitive dual).[130] In Modern Standard Arabic usage, full case inflection persists in formal writing and recitation, though spoken approximations often drop iʿrāb while retaining gender and number markers for clarity.[127]| Category | Nominative (Indefinite Singular Masculine) | Accusative/Genitive | Example |
|---|---|---|---|
| Case Endings | -un (ḍammah + nūn) | -an/-in (fatḥah/kasrah + nūn) | waladun (boy, nom.); waladan/waladin (acc./gen.)[125] |
| Feminine Marker | -atun | -atan/-atin | bintun (girl, nom.); bintan/bintin[127] |
| Sound Plural (Masc.) | -ūna | -īna | waladūna (boys, nom.); waladina[128] |
| Sound Plural (Fem.) | -ātu | -āti | banātu (girls, nom.); banāti[128] |
Verbal Conjugation and Aspects
Arabic verbs are primarily derived from triconsonantal roots, which consist of three consonants encoding the core semantic content, combined with fixed vowel patterns and affixes to form specific stems known as awzān or forms.[132] These forms, numbered I through X (with additional rare forms), systematically modify the root to express nuances such as causativity, reflexivity, or intensity; for instance, Form I (faʿala) typically denotes the basic action, while Form II (faʿʿala) often indicates intensification or causativity, as in kataba ("he wrote") versus kattaba ("he caused to write").[124] Trilateral roots predominate, though quadriliteral roots exist for some verbs, yielding parallel form series.[133] Verbal conjugation in Arabic distinguishes two primary stems: the perfective, used for completed actions akin to the simple past, and the imperfective, employed for ongoing, habitual, or future actions, thus emphasizing aspect over strict tense.[134] The perfective stem conjugates by suffixation for person, number, and gender—e.g., for the root ktb ("write"), third-person singular masculine kataba ("he wrote"), dual katabā, plural katabū, with feminine forms like katabat—while the imperfective prefixes prefixes like ya-, ta-, or na- and suffixes for similar categories, such as yaktubu ("he writes/is writing").[135] Mood is marked on the imperfective via vowel endings or deletions: indicative (yaktubu), subjunctive (yaktuba), and jussive (yaktub), the latter often for negation or commands.[136] The ten common verb forms exhibit distinct patterns; Form III (fāʿala) suggests reciprocal action (kātaba, "he corresponded with"), Form IV (ʾafʿala) causativity (ʾaktaba, "he dictated"), Form V (tafaʿʿala) reflexivity (takataba, "he subscribed"), Form VI reciprocity (takātabā, "they corresponded"), Form VII inchoativity (inkataba, "it was written"), Form VIII reflexive/causative (ihtakaba, "he hid"), Form IX color/intensification (iḥmarra, "it became red"), Form X requestive (istaktaba, "he asked to write"), with Forms I and II as baselines.[124][137] Weak roots (involving w, y, or hamza) trigger vowel shifts or assimilations, complicating paradigms, as in qāla ("he said") from q-w-l.[138] In Modern Standard Arabic, this system remains robust, but spoken dialects often simplify conjugations—e.g., Levantine prefixes b- to imperfectives for present/future (byaktib, "he writes"), reduces dual forms, or alters negation (ma katab, versus MSA lam yaktub)—while retaining root-form foundations for mutual intelligibility.[139][140] Empirical analyses confirm the aspectual primacy, with perfective denoting bounded events and imperfective unbounded ones, influencing syntax like adverb compatibility.[141]Syntactic Features Across Varieties
Arabic syntactic structures vary considerably between Modern Standard Arabic (MSA), used in formal writing and media, and the diverse spoken dialects, which reflect regional substrate influences and simplification trends over centuries of oral transmission. MSA retains much of Classical Arabic's flexibility, including verb-subject-object (VSO) order as the unmarked literary form, though subject-verb-object (SVO) is increasingly common in contemporary usage for pragmatic emphasis.[142] [143] In contrast, dialects across regions like the Levant, Egypt, and the Gulf predominantly favor rigid SVO order, reducing reliance on case endings for disambiguation and aligning more closely with contact languages such as Aramaic or Turkish.[67] [144] Negation strategies highlight another divergence: MSA employs tense-sensitive particles, such as lā for imperfective verbs, lam inducing jussive mood for past negation, and laysa for nominal predicates, preserving aspectual nuances.[145] Dialects simplify this system, typically prefixing invariant particles like ma or mu to the verb regardless of tense, as in Levantine ma biddī ('I don't want') or Gulf mā katab ('he didn't write'), often contracting with suffixes for past negation (e.g., Egyptian ma katabš).[146] [147] Copular negation in dialects frequently uses forms like mīš or muu, diverging from MSA's laysa and reflecting analogical leveling across verbal and nominal domains.[148] Subject-verb agreement patterns also differ markedly. In MSA, preverbal subjects in SVO trigger full phi-feature agreement (gender, number, person), while postverbal plurals in VSO yield partial agreement—gender but default singular number—constrained by hierarchical feature valuation.[149] [150] Dialects exhibit further reduction, with widespread "deflected" or collective agreement where third-person plural subjects elicit feminine singular verb forms, particularly in perfective tenses, as in Najdi katabū alternating with katbat for 'they (m.) wrote'.[151] This phenomenon, documented in Levantine, Egyptian, and Peninsular varieties, correlates with aspectual marking rather than strict number agreement, indicating a shift toward semantic rather than morphological concord.[152] [153] Relative clause formation underscores regional variation. MSA requires gendered, numbered relative pronouns (alladhī for masculine singular, allatī for feminine), integrating the clause tightly without resumptives for subjects.[154] Spoken dialects innovate with uninflected particles like illi (Levantine, Egyptian) or li (Maghrebi), often inserting resumptive pronouns for oblique or object gaps to aid parsing in the absence of case morphology, as in Syrian il-bint illi šift-ha ('the girl that I saw her').[155] [156] In some Gulf and Bedouin dialects, zero-relativization or adverbial particles prevail, prioritizing economy over explicit marking.[157] These adaptations enhance fluency in rapid speech but challenge mutual intelligibility across dialect continua.[158]Writing System and Orthography
Arabic Script Structure and Direction
The Arabic script employs a cursive structure written from right to left, with letters assuming contextual glyph forms depending on their position in a word: isolated, initial, medial, or final. This shaping facilitates connectivity, as the script is inherently joined-up, allowing most letters to link with adjacent ones for fluid word formation.[159][160] Of the 28 letters in the standard Arabic alphabet, 22 exhibit dual-joining behavior, connecting to both preceding and following letters when possible, while six letters—alif (ا), dāl (د), ذāl (ذ), rāʾ (ر), zāy (ز), and wāw (و)—are right-joining only, refusing connection to the letter on their left (the subsequent one in reading order). This non-joining property creates breaks in cursive flow, affecting word appearance and requiring specific rendering rules in digital systems. Letters are penned starting from the rightmost position, progressing leftward, which aligns with the script's Semitic heritage and optimizes ink flow in traditional nib-based writing.[161][104] In bidirectional text environments, Arabic's right-to-left directionality interacts with left-to-right elements like European numerals or Latin script, which retain their inherent orientation within Arabic spans, necessitating algorithms such as Unicode's bidirectional algorithm for proper display. The baseline remains horizontal throughout, with vertical extensions (ascenders like alif and descenders like final yāʾ) varying by letter to maintain legibility and aesthetic balance in connected sequences.[159][162]Vowel Diacritics and Ambiguities
The short vowels in the Arabic script are represented by optional diacritical marks called harakat (حَرَكَات), which are superimposed on consonants to specify pronunciation. These include fatha (َ) denoting a short /a/ sound as in "fatḥ" (فَتْح), damma (ُ) for /u/ as in "dun" (دُنْ), and kasra (ِ) for /i/ as in "kitāb" (كِتَاب).[163] The sukun (ْ) mark indicates a consonant without a following vowel, preventing elision, while tanwīn variants (ً ٌ ٍ) combine short vowels with nunation for indefinite nouns.[164] Long vowels, by contrast, are typically spelled using matres lectionis such as alif (ا) for /aː/, wāw (و) for /uː/, and yāʾ (ي) for /iː/, without diacritics.[165] Harakat originated in the 8th century CE to standardize Quranic recitation, with systematic application attributed to scholars like Abū al-Aswad al-Duʾalī (d. 688 CE) and later refinements by al-Khalīl ibn Aḥmad (d. 791 CE).[166] In fully vocalized (muṣḥaf) texts, such as Uthmanic Qurans or pedagogical editions, harakat disambiguate morphology and syntax; for example, they distinguish verbal forms like kataba (كَتَبَ, "he wrote") from nominals like kutub (كُتُبْ, "books").[167] However, in everyday prose, journalism, and most printed materials since the medieval period, harakat are routinely omitted, relying on reader familiarity with root-pattern morphology, context, and prosodic cues to infer vowels.[165] This defectively vocalized orthography (rasm) prioritizes skeletal consonants, reflecting the script's abjad nature derived from Nabataean Aramaic.[104] Omission of harakat generates widespread ambiguities, particularly homographic forms (ishtibāk) where identical consonant sequences yield divergent meanings based on vocalization.[168] Lexical ambiguities arise from root derivations; for instance, the skeleton "slm" (سلم) can vocalize as salām (سَلَام, "peace"), sulām (سُلَّام, "ladder"), or sallama (سَلَّمَ, "he handed over").[169] Grammatical ambiguities compound this, as inflectional endings (e.g., case markers ʾiʿrāb) are vowel-dependent and invisible without diacritics, potentially conflating nominative rafʿ (ـُ) with accusative naṣb (َـً). Studies show that unvocalized text activates multiple interpretations, with diacritics reducing heterophonic homograph confusion by up to 20-30% in comprehension tasks, though native speakers resolve most via syntactic context and frequency.[170] In computational linguistics, this necessitates diacritization algorithms, as unvocalized Arabic exhibits morphological ambiguity rates averaging 5-7 possibilities per word form.[171] Such ambiguities pose challenges for non-native learners and early readers, who depend on explicit vocalization in primers (muʿjam or ṣarf texts), but pose minimal issues for fluent speakers attuned to diglossic norms separating unvocalized Modern Standard Arabic (fusha) from dialectal speech.[172] Proposals for mandatory diacritics, as in some 20th-century reform debates (e.g., by Louis Massignon in 1920s Egypt), have failed due to aesthetic, practical, and tradition-bound resistance, preserving the script's conciseness at the cost of initial opacity.[173]Calligraphy, Variants, and Numerals
Arabic calligraphy emerged in the 7th century CE alongside the Quran's revelation, serving as a medium to visually honor sacred texts through disciplined script forms that emphasized proportion and rhythm.[174] Early practitioners adapted pre-Islamic scripts, refining them to suit the Arabic abjad's requirements, with the art gaining prominence due to Islam's aniconic traditions that discouraged figurative representation in religious contexts.[175] Over centuries, it evolved from utilitarian writing to a revered craft, influencing architecture, manuscripts, and decoration across the Islamic world, where scribes like Ibn Muqla (d. 940 CE) standardized proportions based on geometric principles such as the circle and rhombus.[176] The primary styles, or qalam (pen types), include Kufic, an angular script originating in Kufa, Iraq, around the 8th century, characterized by bold, geometric strokes suitable for stone inscriptions and early Quranic codices, though its rigidity limited speed.[177] Naskh, developed later in the 10th century as a cursive alternative, features fluid, rounded forms that enhanced readability and became the standard for printed Arabic texts due to its balance of elegance and practicality.[178] Other variants encompass Thuluth, with elongated verticals for monumental use; Diwani, ornate and slanted for Ottoman decrees; and Ruq'ah, a simplified, rapid style for everyday correspondence.[179] Regional adaptations, such as Maghrebi scripts in North Africa with looped letters, reflect local pen angles and materials, demonstrating how geographic and cultural factors shaped script divergence without altering core phonemic representation.[180] Arabic numerals encompass both historical and contemporary systems. The Abjad numeral system, predating widespread decimal adoption, assigns values to the 28 letters of the Arabic alphabet—alif for 1, ba' for 2, up to ghayn for 1000—facilitating calculations and chronograms in medieval manuscripts, poetry, and astronomy, as seen in works by scholars like al-Biruni (d. 1048 CE).[181] This method, akin to Roman numerals in its alphabetic basis, persisted in esoteric and literary contexts but yielded to positional decimal numerals by the 9th century, when Arabic intermediaries transmitted the Indian zero-based system westward.[182] Modern Eastern Arabic numerals (٠ ١ ٢ ٣ ٤ ٥ ٦ ٧ ٨ ٩), used in most Arab countries, preserve shapes closer to their 9th-century Persian-Arabic forms, differing from Western Arabic numerals (0-9) which evolved through European adaptations starting with Fibonacci's 1202 CE introduction.[183] Originating in India around the 6th century CE, these glyphs entered Arabic scholarship via translations in Baghdad's House of Wisdom, enabling advancements in algebra and trigonometry, though Eastern variants avoided the flattening seen in Latin scripts due to sustained calligraphic influence.[184] In practice, both numeral sets coexist in digital interfaces, with Eastern forms mandatory in contexts like Saudi riyal denominations to maintain cultural continuity.[185]Romanization, Arabizi, and Reform Debates
Romanization of Arabic refers to standardized systems for transcribing the Arabic script into the Latin alphabet, primarily for scholarly, bibliographic, and computational purposes. The American Library Association-Library of Congress (ALA-LC) system, formalized in its 2012 table, renders consonants and vowels with diacritics for precision, such as dh for ذ and ū for long u, while handling the definite article al- without capitalization changes beyond English norms.[186] Other schemes include the United Nations' 2007 romanization, which maps Arabic letters to Latin equivalents like kh for خ, and phonemic approaches like the CJKI Arabic Romanization System (CARS), designed for language learners by prioritizing pronunciation over orthographic fidelity.[187][188] These systems address ambiguities in Arabic orthography, such as unvocalized short vowels, but lack universality, resulting in variant transliterations like Qur'an versus Quran across publications.[189] Arabizi, also termed the Arabic chat alphabet, emerged in the late 1990s amid limited Arabic keyboard support in early internet and SMS technologies, enabling informal transliteration of dialects using Latin letters and numerals to approximate phonemes absent in English.[190] Common substitutions include 2 for hamza (ء), 3 for ʿayn (ع), 5 for khāʾ (خ), 6 for tāʾ (ط), 7 for ḥāʾ (ح), 8 for ghayn (غ), and 9 for qāf (ق), as in rendering "salam" as "slaam" or "shukran" as "shukran" with dialectal tweaks.[191]| Numeral/Letter | Arabic Equivalent | Example Sound |
|---|---|---|
| 2 | ء (hamza) | Glottal stop |
| 3 | ع (ʿayn) | Pharyngeal fricative |
| 5 | خ (khāʾ) | Voiceless velar fricative |
| 6 | ط (ṭāʾ) | Emphatic t |
| 7 | ح (ḥāʾ) | Voiceless pharyngeal fricative |
| 8 | غ (ghayn) | Voiced velar fricative |
| 9 | ق (qāf) | Voiceless uvular stop |
Lexicon and Lexicography
Core Vocabulary and Etymology
Arabic's core vocabulary derives from triconsonantal roots inherited from Proto-Semitic, the common ancestor of Semitic languages originating approximately 5,750 years ago in the Levant region.[199] These roots, typically three consonants encoding a fundamental semantic field, form the basis for deriving nouns, verbs, and adjectives related to essential concepts such as actions, kinship, and natural phenomena, with Arabic preserving many Proto-Semitic forms due to its phonological conservatism.[8] This derivational efficiency allows a single root to generate interconnected terms, as seen in the root k-t-b (marking or inscribing), which produces kataba ("he wrote"), kitāb ("book"), and kātib ("scribe" or "writer").[200][122] Etymologically, core terms often trace directly to Proto-Semitic reconstructions, reflecting shared Semitic heritage while adapting to Arabic's specific sound shifts, such as the retention of gutturals like ḥ and ʿ. For instance, the root r-ḥ-m, denoting compassion or kinship bonds, underlies raḥima ("he had mercy") and raḥim ("womb" or "merciful"), linking familial mercy to maternal origins in a manner consistent with Proto-Semitic semantic extensions.[201] Kinship vocabulary exemplifies this continuity: ʔab ("father") and ʔumm ("mother") align with Proto-Semitic ʔab- (paternal figure) and ʔumm-, terms reconstructed across Akkadian, Hebrew, and Ugaritic cognates.[202] Similarly, baʕl- ("lord" or "master") appears in Arabic as a root for ownership or husbandry, paralleling Proto-Semitic usages in denoting authority or spousal relations.[203] High-frequency roots for basic actions and objects further illustrate Proto-Semitic origins. The root s-l-m, associated with wholeness or peace, yields salām ("peace") and islām ("submission"), with etymological ties to Proto-Semitic šlm for completeness, as evidenced in comparative analyses of Semitic corpora.[204] Numbers and body parts also retain archaic features: yad ("hand") from Proto-Semitic yad-, used instrumentally across Semitic languages, and waḥid ("one") linked to Proto-Semitic ʔaḥad- for unity.[205] This root-based lexicon, comprising over 80% of Classical Arabic's basic lexicon according to morphological surveys, underscores Arabic's role as a conservative repository of Semitic etymological data, though modern dialects introduce variations via substrate influences.[206]Loanwords and Neologisms
Arabic has incorporated loanwords from various languages throughout its history, primarily through phonological and morphological adaptation to fit its triconsonantal root system. In Classical Arabic, borrowings from Persian numbered in the hundreds, including terms like firdaws (paradise) from Middle Persian pairidaēza and jinnī (genie) adapted forms, reflecting cultural exchanges during the Sassanid era before Islam.[207] Greek influences via Syriac intermediaries introduced scientific vocabulary, such as falsafa (philosophy) from philosophia and kīmiyāʾ (alchemy/chemistry) from khēmeia, integrated during the translation movement in Baghdad's House of Wisdom around the 9th century.[207] Aramaic and Syriac contributed administrative and religious terms, like kātib (scribe) variants, due to early Christian and Jewish communities in Arabia.[207] Ottoman Turkish loans entered via administrative rule from the 16th to 19th centuries, particularly in dialects, with examples like qāwūk (cook) from Turkish aşçı, though fewer persisted in Modern Standard Arabic (MSA) due to later purist efforts.[208] In the 19th-20th centuries, European languages influenced MSA amid modernization, yielding direct borrowings such as tilifūn (telephone) from French téléphone and bank (bank) from English, often retaining foreign phonemes while adding Arabic case endings.[209] Dialects exhibit higher loanword density; Levantine Arabic includes English-derived būs (bus) and fīlīm (film), reflecting urbanization and media exposure since the mid-20th century.[210] Neologisms in MSA arise through endogenous processes leveraging the language's morphology, including ishtiqāq (derivation from roots), as in coining ḥāsūb (computer) from the root ḥ-s-b (to calculate) in the 1970s by language academies.[55] Compounding (tarkīb), such as ṭāʾira laḥs (space shuttle, literally "tongue of fire plane"), and semantic extension, where existing roots expand meanings (e.g., intarnat hybridized but often replaced by shabakat al-ʿālam for internet), dominate for technical terms.[211] Loan-translation (tarjamat muḥarrara) and arabization (taʿrīb) adapt foreign concepts, like rendering "email" as barīd iktrōnī (electronic mail) rather than direct īmil, promoted since the 19th-century Nahda revival to preserve linguistic purity.[212] Language academies, established in Cairo (1892), Damascus (1919), and Baghdad (1976), institutionalize neologism approval to counter borrowing proliferation, prioritizing root-based forms over unadapted loans amid globalization pressures.[213] This purism, rooted in 8th-century grammarian traditions, resists full assimilation of terms like English IT jargon, though social media dialects innovate freely with hybrids (e.g., wasāʾil naql jamʿī for public transport apps).[214] Empirical studies indicate arabization succeeds more in formal MSA than colloquial varieties, where English loans comprise up to 10-15% in urban youth speech per sociolinguistic surveys from the 2010s.[210]Historical and Modern Dictionaries
The development of Arabic lexicography originated in the 8th century CE with Al-Khalil ibn Ahmad al-Farahidi's Kitab al-Ayn, recognized as the first dictionary of the Arabic language, completed around 786 CE shortly before the author's death.[215] This work innovatively organized entries by phonetic patterns and rhyme rather than strict alphabetical sequence, beginning with roots featuring the letter ʿayn (ء) to prioritize guttural sounds central to Arabic phonology, and included etymological insights, usage examples from poetry, and definitions drawn from Bedouin informants to capture pre-Islamic vocabulary.[216][217] Subsequent medieval dictionaries expanded on Al-Khalil's foundational method, often compiling from earlier sources to preserve Classical Arabic amid linguistic diversification. Ibn Manzur's Lisan al-Arab, finalized in 1290 CE, stands as one of the most comprehensive, spanning approximately 120,000 entries across 20 volumes in standard editions, with definitions rooted in Quranic verses, hadith, and poetry, emphasizing semantic derivations from triliteral roots while cross-referencing authorities like Al-Khalil.[218] Al-Firuzabadi's Al-Qamus al-Muhit (compiled ca. 1390–1414 CE) condensed vast lexical material into a single-volume reference of over 80,000 words, prioritizing rare terms and dialectical variants to serve scholars, though criticized for occasional omissions of common usages.[219] These works, produced in centers like Baghdad and Cairo, reflected a philological emphasis on fusha (eloquent Arabic) purity, driven by needs to interpret religious texts and counter Persian and Turkish loanword influxes during Abbasid and Mamluk eras.[220] Modern Arabic dictionaries, emerging in the 19th–20th centuries amid nahda (renaissance) reforms and Western scholarly influence, shifted toward Modern Standard Arabic (MSA) while retaining root-based structures for continuity with classical traditions. Hans Wehr's A Dictionary of Modern Written Arabic (first edition 1952, revised 1979) provides a seminal bilingual (Arabic-German/English) resource with over 20,000 root entries, incorporating neologisms, technical terms from science and administration, and contemporary prose examples from newspapers and literature, making it a staple for non-native researchers despite its Eurocentric compilation.[221] Digital initiatives, such as the Hans Wehr online adaptations and projects like ArabicLexicon.Hawramani.com (aggregating 47 classical sources into 229,000+ entries as of 2023), facilitate access but highlight challenges in standardizing MSA against dialectal divergence, with gaps in gender-neutral or colloquial inclusions noted in recent critiques.[217] Efforts in Arab states, including Saudi and Egyptian academies, continue updating lexicons for education, yet reliance on historical corpora persists due to MSA's tethering to Classical norms, limiting full representation of spoken varieties.[222][223]Influence on and from Other Languages
Arabic incorporated numerous loanwords from Aramaic and Syriac, reflecting prolonged contact in the Near East during pre-Islamic and early Islamic times, with examples including religious titles like abbā (father, as in priest) and terms like Iblīs (devil).[224] Syriac influence persisted in Abbasid-era translations, contributing administrative and ecclesiastical vocabulary integrated via phonetic adaptation to Arabic phonology.[207] Greek loans entered primarily through intermediary Aramaic and Middle Persian channels during the Hellenistic period, with around twenty verified terms by the post-Islamic era, expanding in scientific and philosophical domains under the Abbasids.[225] Persian provided borrowings in governance, poetry, and daily life, such as words for fruits and officials, absorbed during Sassanid interactions and Umayyad expansions.[207] Later Ottoman Turkish introduced military and administrative terms, while modern European languages contributed technological neologisms, often arabized to fit root-based morphology.[207] Conversely, Arabic exerted extensive lexical influence on recipient languages through Islamic conquests, trade, and scholarship, embedding terms in religion, science, and commerce. In English, over 100 direct or mediated loans persist, including algebra (from al-jabr, meaning restoration, via medieval mathematical texts), algorithm (from mathematician al-Khwarizmi's name, Latinized as Algoritmi), coffee (from qahwa), sugar (from sukkar), and alcohol (from al-kuḥl, originally a cosmetic powder).[226][227] These entered via Spanish, Italian, or French intermediaries during the Renaissance translation movement from Arabic sources.[228] Spanish absorbed approximately 4,000 Arabic-derived words—constituting about 8% of its modern lexicon—during the nearly 800-year Muslim rule in Al-Andalus (711–1492 CE), with characteristic al- prefixes in terms like alcancía (piggy bank, from al-khazna, treasury) and albaricoque (apricot, from al-barqūq).[229][230] This influence targeted agriculture (aceite, oil, from al-zayt), architecture (azulejo, tile, from al-zulayj), and irrigation (acequia, canal, from al-sāqiya), reflecting practical adaptations in Iberian hydrology and farming.[231] Turkish incorporated thousands of Arabic loans during the Ottoman era (1299–1922 CE), particularly in Islamic jurisprudence, administration, and science, with up to 30% of classical Ottoman vocabulary Arabic-derived, including religious terms like namaz (prayer, adapted from ṣalāh) and abstract concepts like adl (justice).[232] Swahili, via East African trade from the 8th century onward, adopted 15–20% Arabic lexicon, such as kitabu (book, from kitāb) and safari (journey, from safar), integrated into Bantu grammar for commerce and Islam.[233] Persian and Urdu similarly feature heavy Arabic overlays in religious (salām, peace/greeting) and scholarly domains, with Urdu deriving up to 40% of its vocabulary from Arabic-Persian amalgam under Mughal rule (1526–1857 CE).[228] These transfers underscore Arabic's role as a vector for Hellenistic knowledge to Europe and Islamic terminology globally, often unmodified in core phonetics but reshaped syntactically.[228]Usage, Status, and Sociolinguistics
Speaker Demographics and Global Distribution
Arabic is spoken natively by approximately 362 million people worldwide, making it one of the most widely spoken languages by first-language users. Including second-language speakers, primarily those proficient in Modern Standard Arabic (MSA) for religious, educational, or professional purposes, the total number rises to around 422 million. These figures encompass diverse vernacular dialects rather than MSA, which is rarely a native tongue but serves as a lingua franca across Arabic-speaking regions.[234] The vast majority of speakers reside in the Arab world, spanning North Africa and the Middle East, where Arabic holds official status in 22 sovereign states as defined by the Arab League: Algeria, Bahrain, Comoros, Djibouti, Egypt, Iraq, Jordan, Kuwait, Lebanon, Libya, Mauritania, Morocco, Oman, Qatar, Saudi Arabia, Somalia, Sudan, Syria, Tunisia, the United Arab Emirates, and Yemen. Additional countries recognize Arabic as an official or co-official language, including Chad, Eritrea, and Palestine, bringing the total to about 25 nations. In these areas, Arabic speakers constitute over 90% of the population in most cases, though minority languages and dialects coexist. Egypt hosts the largest concentration, with roughly 82 million native speakers, followed by Algeria (40 million), Sudan (28 million), and Saudi Arabia (27 million).[235][236][237]| Country | Estimated Native Speakers (millions) |
|---|---|
| Egypt | 82.4 |
| Algeria | 40.1 |
| Sudan | 28.2 |
| Saudi Arabia | 27.2 |
| Morocco | 25.0 |
| Iraq | 24.7 |
| Yemen | 18.5 |
| Syria | 15.0 |
| Tunisia | 10.9 |
| Jordan | 9.5 |