Hubbry Logo
Nilo-Saharan languagesNilo-Saharan languagesMain
Open search
Nilo-Saharan languages
Community hub
Nilo-Saharan languages
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Nilo-Saharan languages
Nilo-Saharan languages
from Wikipedia

Nilo-Saharan
(disputed)
Geographic
distribution
Central Africa, north-central Africa and East Africa
Native speakers
c. 70 million for all branches listed below.[1]
Linguistic classificationProposed language family
Proto-languageProto-Nilo-Saharan
Subdivisions
Language codes
ISO 639-2 / 5ssa
GlottologNone
Distribution of Nilo-Saharan languages (in yellow)

The Nilo-Saharan languages are a proposed family of around 210 African languages[1] spoken by somewhere around 70 million speakers,[1] mainly in the upper parts of the Chari and Nile rivers, including historic Nubia, north of where the two tributaries of the Nile meet. The languages extend through 17 nations in the northern half of Africa: from Algeria to Benin in the west; from Libya to the Democratic Republic of the Congo in the centre; and from Egypt to Tanzania in the east.

As indicated by its hyphenated name, Nilo-Saharan is a family of the African interior, including the greater Nile Basin and the Central Sahara Desert. Most of its proposed constituent divisions are found in the modern countries of Sudan and South Sudan, through which the Nile River flows.

In his book The Languages of Africa (1963), Joseph Greenberg named the group and argued it was a genetic family. It contained all the languages that were not included in the Niger–Congo, Afroasiatic or Khoisan families. Although some linguists have referred to the phylum as "Greenberg's wastebasket", into which he placed all the otherwise unaffiliated non-click languages of Africa,[3][4] other specialists in the field have accepted it as a working hypothesis since Greenberg's classification.[5] Linguists[who?] accept that it is a challenging proposal to demonstrate but contend that it looks more promising the more work is done.[6][7][8]

Some of the constituent groups of Nilo-Saharan are estimated to predate the African neolithic. For example, the unity of Eastern Sudanic is estimated to date to at least the 5th millennium BC.[9] Nilo-Saharan genetic unity would thus be much older still and date to the late Upper Paleolithic. The earliest written language associated with the Nilo-Saharan family is Old Nubian, one of the oldest written African languages, attested in writing from the 8th to the 15th century AD.

Nilo-Saharan is not accepted by all linguists. Glottolog (2013), for example, a publication of the Max Planck Institute in Germany, does not recognise the unity of the Nilo-Saharan family or even of the Eastern Sudanic branch; Georgiy Starostin (2016) likewise does not accept a relationship between the branches of Nilo-Saharan, though he leaves open the possibility that some of them may prove to be related to each other once the necessary reconstructive work is done. According to Güldemann (2018), "the current state of research is not sufficient to prove the Nilo-Saharan hypothesis."[10]

Characteristics

[edit]

The constituent families of Nilo-Saharan are quite diverse. One characteristic feature is a tripartite singulative–collective–plurative number system, which Blench (2010) believes is a result of a noun-classifier system in the protolanguage. The distribution of the families may reflect ancient watercourses in a green Sahara during the African humid period before the 4.2-kiloyear event, when the desert was more habitable than it is today.[11]

Major languages

[edit]

Within the Nilo-Saharan languages are a number of languages with at least a million speakers (most data from SIL's Ethnologue 16 (2009)). In descending order:

Some other important Nilo-Saharan languages under 1 million speakers:

The total for all speakers of Nilo-Saharan languages according to Ethnologue 16 is 38–39 million people. However, the data spans a range from ca. 1980 to 2005, with a weighted median at ca. 1990. Given population growth rates, the figure in 2010 might be half again higher, or about 60 million.

History of the proposal

[edit]

The Saharan family (which includes Kanuri, Kanembu, the Tebu languages, and Zaghawa) was recognized by Heinrich Barth in 1853, the Nilotic languages by Karl Richard Lepsius in 1880, the various constituent branches of Central Sudanic (but not the connection between them) by Friedrich Müller in 1889, and the Maban family by Maurice Gaudefroy-Demombynes in 1907. The first inklings of a wider family came in 1912, when Diedrich Westermann included three of the (still independent) Central Sudanic families within Nilotic in a proposal he called Niloto-Sudanic;[13] this expanded Nilotic was in turn linked to Nubian, Kunama, and possibly Berta, essentially Greenberg's Macro-Sudanic (Chari–Nile) proposal of 1954.

In 1920 G. W. Murray fleshed out the Eastern Sudanic languages when he grouped Nilotic, Nubian, Nera, Gaam, and Kunama. Carlo Conti Rossini made similar proposals in 1926, and in 1935 Westermann added Murle. In 1940 A. N. Tucker published evidence linking five of the six branches of Central Sudanic alongside his more explicit proposal for East Sudanic. In 1950 Greenberg retained Eastern Sudanic and Central Sudanic as separate families, but accepted Westermann's conclusions of four decades earlier in 1954 when he linked them together as Macro-Sudanic (later Chari–Nile, from the Chari and Nile Watersheds).

Greenberg's later contribution came in 1963, when he tied Chari–Nile to Songhai, Saharan, Maban, Fur, and Koman-Gumuz and coined the current name Nilo-Saharan for the resulting family. Lionel Bender noted that Chari–Nile was an artifact of the order of European contact with members of the family and did not reflect an exclusive relationship between these languages, and the group has been abandoned, with its constituents becoming primary branches of Nilo-Saharan—or, equivalently, Chari–Nile and Nilo-Saharan have merged, with the name Nilo-Saharan retained. When it was realized that the Kadu languages were not Niger–Congo, they were commonly assumed to therefore be Nilo-Saharan, but this remains somewhat controversial.

Progress has been made since Greenberg established the plausibility of the family. Koman and Gumuz remain poorly attested and are difficult to work with, while arguments continue over the inclusion of Songhai. Blench (2010) believes that the distribution of Nilo-Saharan reflects the waterways of the wet Sahara 12,000 years ago, and that the protolanguage had noun classifiers, which today are reflected in a diverse range of prefixes, suffixes, and number marking.

Internal relationships

[edit]

Dimmendaal (2008) notes that Greenberg (1963) based his conclusion on strong evidence and that the proposal as a whole has become more convincing in the decades since. Mikkola (1999) reviewed Greenberg's evidence and found it convincing. Roger Blench notes morphological similarities in all putative branches, which leads him to believe that the family is likely to be valid.

Koman and Gumuz are poorly known and have been difficult to evaluate until recently.[vague] Songhay is markedly divergent, in part due to massive influence from the Mande languages.[5] Also problematic are the Kuliak languages, which are spoken by hunter-gatherers and appear to retain a non-Nilo-Saharan core; Blench believes they might have been similar to Hadza or Dahalo and shifted incompletely to Nilo-Saharan.

Anbessa Tefera and Peter Unseth consider the poorly attested Shabo language to be Nilo-Saharan, though unclassified within the family due to lack of data; Dimmendaal and Blench, based on a more complete description, consider it to be a language isolate on current evidence. Proposals have sometimes been made to add Mande (usually included in Niger–Congo), largely due to its many noteworthy similarities with Songhay rather than with Nilo-Saharan as a whole, however this relationship is more likely due to a close relationship between Songhay and Mande many thousands of years ago in the early days of Nilo-Saharan, so the relationship is probably more one of ancient contact than a genetic link.[5]

The extinct Meroitic language of ancient Kush has been accepted by linguists such as Rille, Dimmendaal, and Blench as Nilo-Saharan, though others argue for an Afroasiatic affiliation. It is poorly attested.

With the possible exception of Eastern Sudanic, there is little doubt that the constituent families of Nilo-Saharan—of which only Eastern Sudanic and Central Sudanic show much internal diversity—are valid groups. However, there have been several conflicting classifications in grouping them together. Each of the proposed higher-order groups has been rejected by other researchers: Greenberg's Chari–Nile by Bender and Blench, and Bender's Core Nilo-Saharan by Dimmendaal and Blench. What remains are eight (Dimmendaal) to twelve (Bender) constituent families of no consensus arrangement.

Greenberg 1963

[edit]
The branches of the Nilo-Saharan languages

Joseph Greenberg, in The Languages of Africa, set up the family with the following branches. The Chari–Nile core are the connections that had been suggested by previous researchers.

Nilo‑Saharan

Koman (including Gumuz)

Saharan

Songhay

Fur

Maban

Chari–Nile

Gumuz was not recognized as distinct from neighbouring Koman; it was separated out (forming "Komuz") by Bender (1989).

Bender 1989, 1991

[edit]

Lionel Bender came up with a classification which expanded upon and revised that of Greenberg. He considered Fur and Maban to constitute a Fur–Maban branch, added Kadu to Nilo-Saharan, removed Kuliak from Eastern Sudanic, removed Gumuz from Koman (but left it as a sister node), and chose to posit Kunama as an independent branch of the family. By 1991 he had added more detail to the tree, dividing Chari–Nile into nested clades, including a Core group in which Berta was considered divergent, and coordinating Fur–Maban as a sister clade to Chari–Nile.[14][15]

Bender revised his model of Nilo-Saharan again in 1996, at which point he split Koman and Gumuz into completely separate branches of Core Nilo-Saharan.[16]

Ehret 1989, 2001

[edit]

Christopher Ehret came up with a novel classification of Nilo-Saharan in 1989, though most of the evidence was not published until 2001.[17] His classification, which was not accepted by other researchers,[15] consists of two primary branches: Gumuz–Koman, and a 'Sudanic' group containing the remainder of the families. Unusually, Songhay is nested within a core group and is coordinate with Maban in a 'Western Sahelian' clade, while Kadu is excluded in Nilo-Saharan. Note that 'Koman' in this classification is equivalent to Komuz, i.e. a family with Gumuz and Koman as primary branches, and Ehret renames the traditional Koman group 'Western Koman'.

Nilo‑Saharan
Koman
Sudanic

Central Sudanic

Northern Sudanic

Kunama

Saharo-Sahelian

Saharan

Sahelian

Fur

Trans‑Sahel
Western Sahelian

Eastern Sahelian (Eastern Sudanic) (including Berta)

Bender 2000

[edit]

By 2000 Bender had entirely abandoned the Chari–Nile and Komuz branches. He also added Kunama back to the "Satellite–Core" group and simplified the subdivisions therein. He retracted the inclusion of Shabo, stating that it could not yet be adequately classified but might prove to be Nilo-Saharan once sufficient research has been done. This tentative and somewhat conservative classification held as a sort of standard for the next decade.[18]

Blench 2006

[edit]

Niger-Saharan, a language macrofamily linking the Niger-Congo and Nilo-Saharan phyla, was proposed by Blench (2006).[19] It was not accepted by other linguists. Blench's (2006) internal classification of the Niger-Saharan macrophylum is as follows:

According to Blench (2006), typological features common to both Niger-Congo and Nilo-Saharan include:

  • Phonology: ATR vowel harmony and the labial-velars /kp/ and /gb/
  • Noun-class affixes: e.g., ma- affix for mass nouns in Nilo-Saharan
  • Verbal extensions and plural verbs

Blench 2010

[edit]

With a better understanding of Nilo-Saharan classifiers, and the affixes or number marking they have developed into in various branches, Blench believes that all of the families postulated as Nilo-Saharan belong together. He proposes the following tentative internal classification, with Songhai closest to Saharan, a relationship that had not previously been suggested:

? Mimi of Decorse

Blench 2015

[edit]

By 2015,[20] and again in 2017,[21] Blench had refined the subclassification of this model, linking Maban with Fur, Kadu with Eastern Sudanic, and Kuliak with the node that contained them, and added a tentative, extinct branch he names "Plateau" as to explain a possible Nilo-Saharan substrate in the Malian Dogon and Bangime languages, for the following structure:

Blench (2021) concludes that Maban may be close to Eastern Sudanic.

Starostin (2016)

[edit]
Starostin's "Macro-Sudanic" in purple, surrounding language families shown as well

Georgiy Starostin (2016),[22] using lexicostatistics based on Swadesh lists, is more inclusive than Glottolog, and in addition finds probable and possible links between the families that will require reconstruction of the proto-languages for confirmation. Starostin also does not consider Greenberg's Nilo-Saharan to be a valid, coherent clade.

In addition to the families listed in Glottolog (previous section), Starostin considers the following to be established:

A relationship of Nyima with Nubian, Nara, and Tama (NNT) is considered "highly likely" and close enough that proper comparative work should be able to demonstrate the connection if it's valid, though it would fall outside NNT proper (see Eastern Sudanic languages).

Other units that are "highly likely" to eventually prove to be valid families are:

In summary, at this level of certainty, "Nilo-Saharan" constitutes ten distinct and separate language families: Eastern Sudanic, Central Sudanic – Kadu, Maba–Kunama, Komuz, Saharan, Songhai, Kuliak, Fur, Berta, and Shabo.

Possible further "deep" connections, which cannot be evaluated until the proper comparative work on the constituent branches has been completed, are:

  • Eastern Sudanic + Fur + Berta
  • Central Sudanic – Kadu + Maba–Kunama

There are faint suggestions that Eastern and Central Sudanic may be related (essentially the old Chari–Nile clade), though that possibility is "unexplorable under current conditions" and could be complicated if Niger–Congo were added to the comparison. Starostin finds no evidence that the Komuz, Kuliak, Saharan, Songhai, or Shabo languages are related to any of the other Nilo-Saharan languages. Mimi-D and Meroitic were not considered, though Starostin had previously proposed that Mimi-D was also an isolate despite its slight similarity to Central Sudanic.

In a follow-up study published in 2017, Starostin reiterated his previous points as well as explicitly accepting a genetic relationship between Macro-East Sudanic and Macro-Central Sudanic. Starostin names this proposal "Macro-Sudanic". The classification is as follows.[23]

Starostin (2017) finds significant lexical similarities between Kadu and Central Sudanic, while some lexical similarities also shared by Central Sudanic with Fur-Amdang, Berta, and Eastern Sudanic to a lesser extent.

Dimmendaal 2016, 2019

[edit]

Gerrit J. Dimmendaal[24][25] suggests the following subclassification of Nilo-Saharan:

Nilo‑Saharan

Dimmendaal et al. consider the evidence for the inclusion of Kadu and Songhay too weak to draw any conclusions at present, whereas there is some evidence that Koman and Gumuz belong together and may be Nilo-Saharan.[26]

The large Northeastern division is based on several typological markers:

Blench 2023

[edit]

By 2023,[27] Blench had slightly revised the model for a deep primary split between Koman–Gumuz and the rest. Kunama and Berta are "provisionally" placed as the next to branch off, because they only partially share the features that unite the rest of the family. However, it is not clear if this is because they actually diverged early, or if they might have lost those features at a later date. For example, Berta shares plausible lexical cognates with the Eastern Jebel languages (East Sudanic) and its system of grammatical number "closely resembles" those of the East Sudanic languages; Kunama could be divergent "due to long-term interaction with Afroasiatic languages." Saharan–Songhay (especially Songhay) have seen substantial erosion of key characteristics, but this appears to be a secondary development and not evidence of early branching. "Core" Nilo-Saharan ("Central African" in Blench 2015) thus appears to be a typological rather than genetic grouping, though Maban is treated as a divergent branch of Eastern Sudanic; Kadu also seems to be quite close. The resulting structure is as follows:

Beyond the work of Colleen Ahland, Blench notes that the inclusion of Koman is buttressed by the work of Manuel Otero.[28] The argument for Songhay is mostly lexical, especially the pronouns. Blench gives Greenberg credit for both East and Central Sudanic. Saharan and Songhay have some "striking" similarities in their lexicon, which Blench argues is genetic, though the absence of reliable proto-Sarahan and proto-Songhay reconstructions makes evaluation difficult.

Glottolog 4.0 (2019)

[edit]

In summarizing the literature to date, Hammarström et al. in Glottolog do not accept that the following families are demonstrably related with current research:

External relations

[edit]

Proposals for the external relationships of Nilo-Saharan typically center on Niger–Congo: Gregersen (1972) grouped the two together as Kongo–Saharan. However, Blench (2011) proposed that the similarities between Niger–Congo and Nilo-Saharan (specifically Atlantic–Congo and Central Sudanic) are due to contact, with the noun-class system of Niger–Congo developed from, or elaborated on the model of, the noun classifiers of Central Sudanic.

Phonology

[edit]

Nilo-Saharan languages present great differences, being a highly diversified group. It has proven difficult to reconstruct many aspects of Proto-Nilo-Saharan. Two very different reconstructions of the proto-language have been proposed by Lionel Bender and Christopher Ehret.

Bender's reconstruction

[edit]

The consonant system reconstructed by Bender for Proto-Nilo-Saharan is:

Labial Coronal Palatal Velar
plosive voiceless *t, *t₂ *k, *kʰ
voiced *b *d, *d₂ *g
fricative *f *s
liquid *r, *l *r₂
nasal *m *n
semivowel *w *j

The phonemes /*d₂, *t₂/ correspond to coronal plosives, the phonetic details are difficult to specify, but clearly, they remain distinct from /*d, *t/ and supported by many phonetic correspondences (another author, Cristopher Ehret, reconstructs for the coronal area the sound [d̪], [ḍ] and [t̪], [ṭ] which perhaps are closer to the phonetic detail of /*d₂, *t₂/, see infra)

Bender gave a list of about 350 cognates and discussed in depth the grouping and the phonological system proposed by Ehret. Blench (2000) compares both systems (Bender's and Ehret's) and prefers the former because it is more secure and is based in more reliable data.[29] For example, Bender points out that there is a set of phonemes including implosives /*ɓ, *ɗ, *ʄ, *ɠ/, ejectives /*pʼ, *tʼ, (*sʼ), *cʼ, *kʼ/ and prenasal constants /*ᵐb, *ⁿd, (*ⁿt), *ⁿɟ, *ᵑg/, but it seems that they can be reconstructed only for core groups (E, I, J, L) and the collateral group (C, D, F, G, H), but not for Proto-Nilo-Saharan.

Ehret's reconstruction

[edit]

Christopher Ehret used a less clear methodology and proposed a maximalist phonemic system:

Labial Dental Alveol. Retrof. Palatal Velar Glottal
plosive implosive *ɗ̣
voiced *b *d̪ *d *ḍ *g
voiceless *p *t̪ *t *ṭ *k
aspirate *pʰ *t̪ʰ *tʰ *ṭʰ *kʰ
ejective *pʼ *t̪ʼ *tʼ *ṭʼ *kʼ
fricative *s, *z *ṣ
nasal simple *m *n
prenasal *ⁿb *ⁿð *ⁿd *ⁿḍ *ⁿg
liquid *l̪ *r, *l
approximant plain *w *j
complex *ʼw *ʼj *h

Ehret's maximalist system has been criticized by Bender and Blench. These authors state that the correspondences used by Ehret are not very clear and because of this many of the sounds in the table may only be allophonic variations.[30]

Morphology

[edit]

Dimmendaal (2016)[24] cites the following morphological elements as stable across Nilo-Saharan:

Comparative vocabulary

[edit]

Sample basic vocabulary in different Nilo-Saharan branches:

Note: In table cells with slashes, the singular form is given before the slash, while the plural form follows the slash.

Language eye ear nose tooth tongue mouth blood bone tree water eat name
Proto-Nilotic[31] *(k)ɔŋ, pl. *(k)ɔɲ *yit̪ *(q)ume *kɛ-la(-c) *ŋa-lyɛp *(k)ʊt̪ʊk *käw *kɛ-ɛt, *kɪ-yat *pi(-ʀ) *ɲam *ka-ʀin
Proto-Jebel[32] **ed ~ *er **si(di ~ gi) **ɲi-di **kala-d **udu **k-afa-d **(g-)am- **kaca **cii ~ *kii **ɲam (siigə, saag)
Temein[33] nɪ́ŋɪ̀nàʈ / kɛ̀ɛ́n wénàʈ / kwèén kɪ́mɪ́nʈɪ̀n / kɪkɪ́mɪ́nʈɪ́nɪ̀ awɪ̀s / kɛ́ɛ̀ʔ mɛ́nɖɪnyàʈ íʈùk / k(w)úʈɪ̀n mónɪ̀ʈ àmɪ̀s / kɔ́maʔ mɛ́rɛŋɪ̀s / mɛ́rɛŋ múŋ láma kàlɪ́n, kàlɪ́ŋ
Proto-Daju[34] *aŋune / *aŋwe ~ *aŋun *wunute / *wunuge *mu-ne *ɲiɣte / *ɲiɣke *ɲabire / *ɲabirta *ikke / *ikku *tamuke *ŋai / *ŋayu *ewete / *ewe *ma- *si- *ange / *angu
Kadugli (Talla dialect)[35] ayyɛ / iyyɛ naasɔ / isinɛ́ ámb-/nigáŋg-árɔk t̪- / iŋŋini áŋdáɗuk / ni- niinɔ / niginíínɔ ariid̪ʊ t̪iŋguba / kuba ffa / nááfa ɓiid̪i oori ɛɛrɛ / nigirɛɛnɛ
Proto-Northern Eastern Sudanic[36] *maɲ *ɲog-ul *em-u *ŋes-il *ŋal *ag-il ~ *ag-ul *ug-er *kɛs-ɛr *koɲ-er- *mban *kal- / *kamb- *(ŋ)ɛr-i
Nara[36] no, nòò / no-ta, nóó-ta tús / túsá demmo, dəmmo, dàm̀mò, dòmmò nɪ̀hɪ̀ / nɪ̀hɪ̀t-tá; nèʃɪ̀ / nèʃá hàggà, àggà, ààdà, hàdà aùlò / aùl-lá; àgúrá / àgúr-tà kitto, kɪ̀tò ketti, kəti, kátɪ́ / ketta, kátá tüm, tûm; kè́l emba, mbàà kal, kál, kár ade, ààdà
Proto-Nubian[36] *maaɲ, sg. *miɲ-di *ugul(-e), sg. *ugul-di ? *ŋil, sg. *ŋíl-di *ŋal, sg. ŋal-di *agil *ùg-er *kiser, sg. *kisir-ti *koor, sg. *koor-ti *es-ti *kal- *er-i
Proto-Taman[36] *me-ti, pl. *mVŋ *(ŋ)usu-ti (sg) *eme, sg. emi-ti (sg.) *ŋesi-t(i), pl. *ŋes-oŋ *laat *auli *agi *kei-ti, pl. *kei-ŋ *gaan; *kiɲe(-ti) (?) *kal /*kaal *ŋan- *(ŋ)aat, pl. *(ŋ)ari-g
Proto-Nyima[36] *a̍ŋV *ɲɔgɔr- *(o)mud̪- (?) *ŋil- ? *ŋàl- *wule *amV *t̪uma *bɔ́ŋ *t̪a̍l- / *ta̍m-
Proto-SW Surmic[37] *kɛɓɛrɛ (pl.) *it̪t̪at *ʊŋɛtʃ (?) *ɲiggɪtta *ʌgʌʌt *(k)-ʊt̪t̪ʊk *ɓɪj- *ɛmmɛ *kɛɛt̪ *maam *ɗak- *ðara
Proto-SE Surmic[37] *kabari *ɲabi (?) *giroŋ *ɲigidda (?) *kat *tuk- *ɲaɓa *giga (?) *kɛdo (?) *ma *sara
Proto-Kuliak[38] *ekw, pl. *ekw=ẹk *beos, pl. *beosẹk *nyab, pl. *nyabẹk *ɛd-eɓ *ak, pl. *akẹk *seh *ɔk *ad, pl. *ad=is *kywɛh *yed, pl. *yedẹk
Shabo[39] k’iti sonɑ k’ɑu hɑndɑ kɑusɛ dɑmo emɑhɑ; egege k’ɔnɑ wɔː woŋgɑse
Ongota[40] ˈʔaːfa ˈwoːwa ˈsiːna (loan?) ʔitiˈma ʔɑdabo (loan?) ˈʔiːfa ˈmitʃa (loan?) ˈhɑntʃa ˈtʃaːhawa ʔeˈdʒak ˈmiʃa
Proto-Sara-Bongo-Bagirmi[41] *kamɔ; *kamu; *kama *imbi; *EmbE; *mbili; *mbElE; *imbil-; *EmbEl- *Samɔ; *Samu; *Somu; *kanu; *kunu; *kVnV *kanga; *nganga *unɖɛ(C-) *tara *manga; *masu; *mVsV; *nɖuma *Kinga; *Kunga; *Kingo *kaga *mEnE; *mAnɛ; *mani *OɲO; *ɔɲɔ; *VɲV *iɭi; *ʈV
Proto-Mangbetu[42] *mʷɔ̀ *bɪ́ *amɔ̀ *kɪ́ *kàɖrà *tí(kpɔ̀) *álí *kpɔ̀ *kɪ́rɪ́ɛ̀ *gʷò *láɲɔ̀ *kɛ̀lʊ̀
Mangbutu[43] owékékí ubí tongi usɛ́ kedrú utí koto ikpi okpá uwɛ ano aɓé
Bale[43] ɲɔ̌ ndǔ̱tú̱ da tso kpa tsú wyɔ ngbá / nzú
Ndru[43] nikpɔ́ ɓi(na) ondǐ̹tsǔ̹ ku da tsu âzû kpá ítsú ǐɗá ɲú óvôná
Ma'di (Uganda)[44] ɔ̀mvɔ̄ lɛ̀ɖá ti àrɪ́ hʷa kʷɛ èyí ɲā
Birri[45] mɛ́; mʊ́ nvö; nvu ímɔ̀; ámɔ̀ ìnɖrɔ́; ìnɖrá tyi(di) ɔ́tɔ́ kpɔ kpi; kpɪ wu ɔnyo iri
Kresh[46] mumu mbímbi uŋú ʃɛ́ʃɛ̀ ndjindja srama kpɔkpɔ́ kpikpi ùyù ɔ́ʃɔ́ díri
Dongo[46] mómu mbimbi ʔɔŋu cẹ̀cẹ̀ ndjándja ọọs kpọkpŏ kpikpi ùyù l-ọc(ic) díri
Aja[46] iɲi mimbi múmú uku ndindyi usa gbäbí cící ɓaɓa kiri
Kunama[47] ùkùˈnà bòbòˈnà ŋèeˈlà ùˈdà kòkòˈbà sàŋˈgà èˈlà bìˈà ˈìŋ(à) ˈkíidà
Berta[48] aře iile amúŋ ndu-fuudí hala n'du k’aβa k’aara s’ís’ía fɪ'ri θɪ́ŋa huu (= foot)
Gumuz, Northern[49] kʼwácá tsʼéa ííta kʼósa kʼótʼá sa maχá ʒákwá ɟá aja tsʼéa
Proto-Koman[28] *D̪E *cʼɛ *ʃʊnʃ *ʃE *lEtʼ̪a *tʼ̪wa *sʼámá; *bàs *ʃUImakʼ *cwálá *jiɗE *ʃa; *kʼama *D̪uga
Gule[50] yan ĭgŭn fufŭn ŏdāīān wāīdjo wŏt āī
Gule[51] yan igă̄n fufan adad ayan ĭten ai
Amdang (Kouchane)[52] ni dili, kiliŋgɛ gʊrnɑ kɑlkɑ dɔlː sɪˈmi tʃoː dʊrtu sɔŋ sunu zɑm tʃuluk
Proto-Maba[53] *kàSì-k *dúrmì *sati-k; *sàdí-k / *sadi-ɲi *delemi-k *fàrí-ŋ *ta-k / *ta-si *-aɲɔ- *mílí-ik
Maba[54] kàʃì-k/-ñi koi-k boiñ sati-k delmi-k kan-a/-tu àríi kàñjí-k soŋgo-k inji mílí-i/-síi
Mimi of Decorse[55] dyo feɾ fir ɲain ɲyo su engi ɲyam
Kanuri[56] shîm sə́mò kə́nzà tímì; shélì tə́làm shíllà kə̀ská njî
Zaghawa[57][58] í kέbέ síná màrgiː tàmsiː áá ógú úrú bɛ̀gìdiː sε:gì tír
Dendi[59] háŋŋá nínè hínydyè dɛ́llɛ̀ méè kpííʀì bíʀí túúʀì hàʀí ŋwáà máà
Tadaksahak[60] haŋgá t-í-nʒar ée-ʃan íilǝs míya kud-én biidí tugúdu aryén ŋá mân

See also

[edit]

References

[edit]

Further reading

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
The Nilo-Saharan languages form a proposed comprising approximately 100 to 150 distinct languages spoken by approximately 50 million people across central, eastern, and northeastern . This family, one of the four major linguistic groupings on the continent alongside Afroasiatic, Niger-Congo, and , was first systematically classified by linguist Joseph H. Greenberg in 1963, though its genetic unity remains debated among scholars due to the phylum's internal diversity and morphological complexity. The languages are distributed along the southern River, the extending south to , westward into the , and northward into the regions of Chad, , and . Nilo-Saharan is typically divided into 10 to 12 primary branches, including the largest, Eastern Sudanic (encompassing such as Dinka with around 2 million speakers and Luo with about 6 million), as well as Central Sudanic, Saharan (e.g., Kanuri, spoken by 4 million), Songhay (e.g., Zarma, 3 million speakers), Maban, Kunama, Koman, Berta, Fur, and sometimes Kuliak or unclassified isolates. These branches reflect a fragmented geographic spread, likely tied to historical migrations along ancient river systems and corridors during periods of climatic change in the . Major languages like Maasai (1 million speakers in and ) and Karamojong in highlight the family's concentration in , while Songhay extends influence in the western . Overall, the accounts for linguistic diversity in regions spanning from and in the west to and in the east. Linguistically, Nilo-Saharan languages exhibit significant typological variation, with some featuring verb-initial , others verb-final, and many employing tone to distinguish meaning (e.g., in Zarma, where pitch alters "bi" to mean "yesterday" or "wound"). Common traits include advanced tongue root (ATR) and a singulative-collective number marking where plurals are unmarked and singulars derived (e.g., Karamojong ŋɛɛti "lice" vs. ŋɛɛti-n ""). Historical records, such as writing from the 8th century in the Nile Valley, underscore the family's antiquity, while ongoing research addresses challenges in reconstructing proto-forms amid areal influences from neighboring families.

Overview

Definition and genetic status

The Nilo-Saharan languages form a proposed macro-family of African languages, hypothesized by Joseph H. Greenberg in 1963 to unite diverse groups previously classified under terms like Macro-Sudanic and Chari-Nile based on shared morphological and lexical features. This encompasses approximately 210 languages spoken by around 70 million people (as of 2024), primarily across central, eastern, and parts of western . The genetic status of Nilo-Saharan remains debated, with its unity posited on shared innovations such as verb-final syntax in some branches and recurrent morphemes, yet contested due to limited comparative evidence, potential areal diffusion from neighboring families like Afroasiatic and Niger-Congo, and insufficient regular sound correspondences. Key skeptics, including Gerrit J. Dimmendaal, argue that while subgroups like Eastern Sudanic show coherence, the overall lacks robust proof of common ancestry, leading some to treat it as a typological rather than genetic grouping. The scope of Nilo-Saharan includes core branches such as Songhay (spoken along the ), Saharan (around ), Central Sudanic (in the and ), and the largest, Eastern Sudanic (spanning from Nubian in to Nilotic in ); it also incorporates smaller or isolate-like groups like Koman and Gumuz along the Ethiopia- border. Unclassified or tentatively affiliated languages within the proposed phylum include Shabo in southwestern and the ancient of .

Geographical distribution and demographics

The Nilo-Saharan languages are distributed across northern and , spanning from the valley in and in the west, through the , , the , , and , to , , , , and in the east. This extensive range, covering parts of 17 countries, aligns with major historical watercourses such as the and Chari rivers, which facilitated early settlements and movements. The family includes approximately 210 distinct languages (as of 2024), organized into 8 to 12 primary branches, such as Songhay, Saharan, Central Sudanic, and Eastern Sudanic (including Nilotic). Recent estimates indicate around 70 million speakers in total (as of 2024), with the largest populations concentrated in among Nilotic branches. For example, Dinka has approximately 4.5 million speakers primarily in , while is spoken by roughly 4.2 million people in and . Demographically, speaker communities are diverse, often tied to pastoralist and agricultural lifestyles, with significant concentrations in rural areas along riverine and savanna zones. Historical migration patterns, particularly southward expansions of Nilotic groups from the Sudan region into modern-day Kenya and Tanzania, have been driven by pastoralism and resource availability, shaping current distributions. However, language vitality varies; while major languages like Dinka and Luo remain robust, smaller branches face decline due to assimilation, urbanization, and conflict, with some isolates nearing extinction. The Koman branch, for instance, has an estimated 50,000 to 210,000 speakers across its languages and is considered endangered except for Gumuz, due to intergenerational transmission challenges and external pressures.

Historical development

Early studies and subgroup recognitions

The earliest efforts to identify linguistic unities within the diverse languages of the Nile Valley and surrounding regions date to the mid-19th century. In 1854, German explorer and scholar noted connections between Kanuri and Teda (now part of the Teda-Daza subgroup), marking one of the first recognitions of cohesion within the based on shared vocabulary and grammatical features observed during his travels in . Toward the end of the century, Austrian linguist Friedrich Müller advanced the study by proposing links among what he termed ", including Nubian, Nilotic, and related varieties, in his multi-volume Grundriss der Sprachwissenschaft (1876–1882), where he highlighted morphological and lexical parallels suggesting historical relationships. Early 20th-century scholarship built on these foundations with broader classifications of "Sudanic" languages, often encompassing both Niger-Congo and Nilo-Saharan groups. Diedrich Westermann's Die Sudansprachen (1911) and subsequent works through 1927 grouped languages from to under a Sudanic umbrella, emphasizing tonal systems, noun , and verbal structures as common traits, though without firm genetic ties. Carl Meinhof's Die Sprachen der Hamiten (1912) introduced the , positing a distinct family of "inflecting" languages (including Cushitic and Berber) that contrasted with "prefixing" Sudanic ones, thereby shaping perceptions of non-Bantu African languages as typologically divided rather than related. By the , missionary-linguist Stefano Santandrea identified unity in through detailed grammars, such as his Grammatichetta Giur (1946) for the Jur language, noting shared pronominal systems and verb morphology; he later expanded this work to include Kresh, Aja, and Baka in The Kresh Group, Aja and Baka Languages (Sudan): A Linguistic Contribution (1976). Significant milestones in documentation included the attestation of , the earliest written Nilo-Saharan language, preserved in Christian liturgical and administrative texts from the Kingdom of dating to the 8th through 15th centuries AD, providing invaluable evidence of an Eastern Sudanic branch. A comprehensive synthesis appeared in 1956 with A. N. Tucker and M. A. Bryan's The Non-Bantu Languages of North-Eastern Africa, a handbook that cataloged over 200 languages and proposed the Chari-Nile grouping, uniting Central and Eastern Sudanic varieties based on comparative wordlists and structural resemblances. These pre-1963 investigations largely emphasized areal-typological similarities—such as tone, verb serialization, and head-marking grammar—over demonstrable genetic descent, resulting in fragmented subgroup proposals without an integrated family-level hypothesis until Greenberg's later work.

Greenberg's proposal

In 1963, Joseph H. Greenberg formulated the Nilo-Saharan language family as part of his comprehensive genetic classification of African languages, detailed in his memoir The Languages of Africa, published in the International Journal of American Linguistics. He proposed this phylum by integrating the Songhay languages of the western Sahel, the Saharan languages of the central Sahara, and the Chari-Nile group—previously recognized in earlier studies but now expanded and repositioned—into a unified genetic entity spanning much of northern and eastern Africa. This synthesis marked the first holistic proposal of Nilo-Saharan as a coherent family, encompassing languages from diverse ecological and cultural zones. Greenberg's methodology centered on mass comparison, a technique that systematically surveys extensive lexical and grammatical data across languages to detect patterns of resemblance indicative of common ancestry, rather than relying solely on strict sound laws or limited word lists. He emphasized phonological similarities in roots and affixes, alongside shared morphological patterns such as extensions and systems, to argue for genetic ; this approach yielded numerous proposed cognates, including forms for basic vocabulary like body parts, numerals, and pronouns that recurred across the proposed branches. While not employing formal with percentage-based divergence calculations, his comparisons drew on available dictionaries, field notes, and prior subgroup analyses to build a case for relatedness. The initial structure outlined by Greenberg featured a flat hierarchy with several coordinate branches: Songhay and Saharan as primary units, alongside and Maban (the latter including Mimi and Sara-Bagirmi languages) treated as affiliated families; Koman and Berta were positioned as distinct branches, while the expansive Chari-Nile subgroup incorporated Central Sudanic (e.g., Moru-Madi and Zande clusters) and Eastern Sudanic (including Nilotic and ). This configuration highlighted Nilo-Saharan's internal diversity, with Chari-Nile forming the demographic core due to its numerous Eastern Sudanic varieties. Greenberg's proposal successfully consolidated previously isolated or loosely connected groups into a single , providing a foundational map for African linguistics that facilitated further . However, it faced criticism for its broad, impressionistic strokes—particularly the mass comparison method's vulnerability to chance resemblances and borrowings without rigorous reconstruction of proto-forms or sound laws—which some scholars argued undermined claims of deep genetic unity. Nonetheless, this work remains seminal, shaping all major subsequent investigations into Nilo-Saharan's validity and internal relations.

Classification

Major proposals and models

Following Joseph Greenberg's foundational proposal in 1963, subsequent classifications of the Nilo-Saharan family have refined subgroupings, debated branch inclusions, and incorporated new evidence from , often highlighting structural variations across the . M. Lionel Bender's models, developed between 1989 and 2000, expanded the family by including the -Maban and Kadu languages, which had been variably affiliated in earlier schemes. In his 1996 comparative essay, Bender outlined a core Nilo-Saharan comprising 10 primary branches: Central Sudanic, Eastern Sudanic, Saharan, Songhay, Koman, Gumuz, Berta, Kunama, Kadu, and , emphasizing lexical and morphological correspondences to support their unity. His 1991 classification tree further underscored the internal diversity of Eastern Sudanic, positioning it as a major with multiple subclusters while treating other branches as coordinate. Christopher Ehret's classifications from 1989 to 2001 introduced a more hierarchical, nested structure, diverging from Bender's flatter model by embedding Songhay within a broader Sudanic core and recognizing Gumuz-Koman as a distinct intermediate branch linking western and eastern elements. Ehret's 2001 historical-comparative reconstruction proposed four main divisions—Northern (Saharan and Songhay), Central (Central Sudanic), Eastern (Eastern Sudanic), and Coman (including Gumuz-Koman, Berta, and Kunama)—while noting early Afroasiatic substrate influences on Nilo-Saharan vowel systems and nominal morphology. Roger Blench's work from 2006 onward broadened the discussion by proposing a Niger-Saharan macrophylum, subsuming Niger-Congo as an eastern branch of Nilo-Saharan based on shared features like , labial-velar consonants, and over 30 reconstructed lexical roots such as *bale ('two') and *kulu ('knee'). In publications from 2010 to 2015, Blench strengthened the case for a Saharan-Songhay through aligned pronouns, moveable k- and n- prefixes, and basic vocabulary like 'hand' and 'water', arguing against explanations via borrowing due to limited historical contact. His 2023 analysis defended Nilo-Saharan coherence via three-term number marking and affixes, incorporating preliminary computational to validate morphological patterns across branches. Alternative frameworks include Georgiy Starostin's 2016 lexicostatistical analysis, which posited a "Macro-Sudanic" grouping of 10 families—encompassing Central Sudanic, Eastern Sudanic, Koman, and others—but rejected full genetic unity for Nilo-Saharan due to low retention rates below 15% in deeper comparisons. Gerrit J. Dimmendaal's typological models from 2016 to 2019 emphasized a primary northeastern division, comprising Maban, Kunama, , Tama, and Berta as a conservative with pitch-accent systems and analytic morphology, contrasted against more synthetic central (Central Sudanic) and southern (Nilotic) branches. 4.0 (2019) maintains a conservative stance, accepting core branches like Saharan, Central Sudanic, Eastern Sudanic, and Koman while classifying Songhay, Kadu, and Shabo as unclassified isolates or separate families due to insufficient shared innovations. Central debates revolve around the status of peripheral branches: Songhay's inclusion is contested owing to its atypical verb morphology and potential Berber substrates, leading many to treat it as an independent family; Kadu is tentatively linked via pronouns but lacks robust etymologies; and Shabo remains an with minimal Nilo-Saharan resemblances beyond basic . Computational methods, such as Starostin's automated detection on Swadesh lists, have tested viability—confirming Central Sudanic unity at ~30% retention—but highlight challenges in applying to Nilo-Saharan's time depth exceeding 8,000 years, favoring traditional reconstruction for deeper ties.

Constituent families and branches

The Nilo-Saharan phylum encompasses several primary branches, with classifications typically recognizing 8–12 major families, though the inclusion of certain groups remains debated among linguists. Core branches include Songhay, Saharan, Central Sudanic, and the expansive Eastern Sudanic, alongside smaller ones such as Koman, Gumuz, Berta, Maban, and ; proposals like that of Bender (2000) further incorporate the Kadu languages as a constituent branch. The Songhay branch consists of 10–12 closely related languages spoken across , mainly along the in , , , and . Prominent examples include Songhay (also known as Koyra Chiini) with approximately 1 million speakers and Zarma with over 2 million. Its affiliation with Nilo-Saharan is controversial, as typological features suggest possible historical contact with rather than deep genetic ties. Saharan, a small but coherent branch, comprises about 5 languages distributed in the central of , , , and . Key languages include Kanuri, spoken by around 4 million people primarily in northeastern and , and Teda-Daza in the . This branch is noted for its relative internal stability and verb-final word order. The Central Sudanic branch is one of the larger groups, with over 60 languages spoken in the Chari-Nile basin of , including parts of the Democratic Republic of Congo, , , and . Subgroups feature Moru-Madi (e.g., Moru with about 200,000 speakers) and Zande (around 1 million speakers in the Democratic Republic of Congo and ). Languages in this branch often exhibit complex verb morphology and are geographically concentrated around riverine areas. Eastern Sudanic represents the phylum's most diverse and populous branch, encompassing over 100 languages across sub-branches such as Nilotic, Nubian, Taman, Komanic (sometimes separate), and Surmic. It spans from the Nile Valley in and to eastern in , , , and . The Nilotic sub-branch includes major languages like Dinka (over 2 million speakers in ) and Luo (Dholuo, approximately 4.2 million speakers in and as of recent estimates), reflecting expansions of Eastern Nilotic pastoralist groups over the past millennium. Nubian languages, such as Nobiin (approximately 685,000 speakers along the as of 2024), and Taman (e.g., Tama in ) highlight the branch's northern extent. Smaller branches include Koman, with 5 languages (e.g., Komo and Opo) spoken in southwestern and southeastern ; Gumuz, comprising 2 languages in northwestern ; Berta, with 4 languages along the - border (e.g., Berta proper); Maban, featuring 10 languages in eastern and western (e.g., Maba); and , with 3 closely related languages in western (e.g., with approximately 790,000 speakers as of 2023). The Kadu (or Kadugli-Krongo) languages, numbering about 10 and spoken in southern , are provisionally included in some models but lack robust shared innovations confirming their status. Among unclassified or isolate elements are Shabo, an of southwestern with approximately 400-600 speakers as of 2024, and the extinct , attested in ancient Sudanese inscriptions from the 3rd century BCE to 4th century CE. Extinct branches, such as the Plateau group in , are also documented in historical records. Nilo-Saharan languages collectively have around 60–70 million speakers, with vitality varying widely: robust major languages like Luo, Kanuri, and Dinka support large communities, while many isolates and smaller branches, including Shabo and several Koman varieties, are moribund with declining speaker bases due to assimilation and .

External affiliations

Several hypotheses have been advanced to link the Nilo-Saharan phylum genetically to other major African language families, forming proposed macro-families, though none have achieved scholarly consensus. One prominent proposal is the Niger-Saharan macrophylum, which posits a deep genetic relationship between Nilo-Saharan and Niger-Congo. This hypothesis, developed by Roger Blench in 2006, draws on shared phonological features such as advanced tongue root (ATR) and labial-velar consonants (e.g., /kp/, /gb/), morphological parallels including noun-class affixes like *ma- for mass nouns and verbal extensions, and over 100 lexical etymologies. Notable examples include reconstructed roots for numerals, such as *#bale ("two"), *#naN ("four"), and *#turu ("five"), which exhibit a tripartite structure uncommon elsewhere, as well as pronouns and terms like *#deNe ("") and *#kulu ("knee"). Christopher Ehret's 2001 reconstruction of Proto-Nilo-Saharan provides a comparative framework for the phylum's internal branches and offers etymological , such as over 100 and systematic correspondences in pronouns and basic vocabulary (e.g., first-person singular forms), potentially relevant to broader hypotheses like Niger-Saharan. Blench extends this by including additional Niger-Congo subgroups, proposing that areal contacts in the and facilitated but do not fully explain the resemblances. Speculative ties have also been suggested between Nilo-Saharan and the of , but these lack robust genetic evidence and are generally dismissed as unlikely due to geographic separation and absence of regular correspondences. More plausibly, the ancient of has been classified as part of the Northern East Sudanic branch of Nilo-Saharan, potentially serving as a historical bridge to through shared lexical and morphological traits like verbal derivations. This affiliation, supported by decipherment efforts revealing Nilo-Saharan roots in Meroitic texts from 300 BCE to 400 CE, was further reinforced in 2025 by computational approaches confirming its Eastern Sudanic placement. This underscores possible deep-time connections within the phylum but does not extend to external macro-families. Critics argue that the proposed macro-family links rely heavily on lexical resemblances, such as pronouns and numerals, which may result from chance or ancient borrowing rather than genetic inheritance, and fail to demonstrate consistent sound laws across the diverse groups. Methodological concerns include selective data use and the influence of substrate effects in contact zones like Songhay. As a result, there is no consensus on these hypotheses; most linguists view the similarities as products of areal convergence in rather than shared ancestry, pending further reconstructions.

Evidence of contact and borrowing

Nilo-Saharan languages have experienced extensive areal interactions with neighboring language families, particularly in the and Valley regions, leading to significant lexical borrowing and structural convergence. In the zone, , a of Nilo-Saharan, show heavy influence from Afroasiatic due to historical trade and Islamic expansion, with 1,379 loanwords documented across the Nilo-Saharan , predominantly nouns related to , administration, and daily life. For instance, terms like aluula 'noon prayer' derive directly from al-ʿūlā, transmitted via medieval trade routes from to . Similarly, in the Valley, () borrowed extensively from Coptic and Greek during the Christian period (6th–15th centuries CE), incorporating religious and administrative vocabulary; examples include Old Nubian ⲁⲅⲅⲉⲗⲟⲥ 'angel' from Greek ángelos and ⲟⲣⲡ- 'wine' from Coptic orp-. These borrowings often entered via Coptic intermediaries, reflecting Nubia's position as a cultural crossroads. Contact with Niger-Congo languages is prominent around and in , where like Kanuri have adopted numerous Hausa (Chadic, Afroasiatic) loanwords, serving as intermediaries for broader West African influences. Semantic domains of these borrowings in Kanuri include , , and , with Hausa providing terms that Kanuri speakers integrated through phonological , such as deglottalization and sonorization processes. Pastoral terminology also shows shared vocabulary across Nilo-Saharan and Afroasiatic, such as roots for and herding practices, likely diffused through Cushitic pastoralist migrations into Nilotic areas during the late . In , substrate effects from Ubangi (Niger-Congo) groups have influenced lexical and phonological features, evident in shared terms for riverine and forest resources in contact zones like the of Congo. Areal features, such as the development or reinforcement of tonal systems in some Nilo-Saharan languages, may stem from prolonged contact with Niger-Congo expansions, including Bantu, which introduced tone as a prosodic marker in overlapping regions. These interactions explain superficial typological resemblances sometimes misinterpreted as genetic affiliations, as in proposed macro-family links, and have facilitated language shifts; for example, Northern Songhay varieties exhibit a Berber substratum from non-Tuareg Western , contributing to and grammatical hybridization in nomadic communities. Recent research underscores these dynamics, with Blench (2023) highlighting how Afroasiatic contact eroded core Nilo-Saharan morphological traits in , such as tripartite number marking, resulting in convergent structures like simplified pluralization. No major updates on these contacts have emerged in 2024–2025 studies, as of November 2025.

Phonology

Consonant reconstructions

The reconstruction of the consonant inventory for Proto-Nilo-Saharan remains a central but contested aspect of the family's , with major proposals differing in scope and detail. M. Lionel Bender's 2000 reconstruction posits a relatively conservative system of 18 , featuring a labialized series alongside plain stops and fricatives, but excluding ejectives. The stops include voiceless /p, t, k/ and voiced /b, d, g/, while the fricatives comprise /f, s, h, θ/; this inventory emphasizes bilabial, alveolar, velar, and palatal places of articulation, with as a secondary feature in some series. In contrast, Christopher Ehret's 2001 maximalist reconstruction expands the inventory to over 25 consonants, incorporating glottalized stops such as /p', t', k'/ and uvular fricatives or stops, drawing on more than 300 etymologies to support a richer phonological profile. Ehret's system includes additional series like implosives and ejectives, reflecting innovations or retentions across branches, and posits uvulars (/χ, ʁ/) to account for correspondences in Saharan and . This approach contrasts with Bender's by integrating more areal and subgroup-specific data, though both models agree on core stops /p, t, k, b, d, g/ and nasals /m, n, ŋ/. Key sound correspondences underpin these reconstructions, such as the velar *k reflecting the Proto-Nilo-Saharan form for 'water' (e.g., *ki or variants) across major branches like Nilotic, Central Sudanic, and Saharan. Debates persist over implosives, with Saharan languages retaining /ɓ, ɗ/ as potential archaisms, while Central Sudanic shows them as innovations contrasting with plain stops like /d/, complicating the proto-form assignment. Reconstructing Proto-Nilo-Saharan faces significant challenges, including sparse lexical and phonological data for isolates like Koman or Berta, which limits reliable comparisons. Areal influences from neighboring phyla, such as Afroasiatic or Niger-Congo, further blur putative proto-forms through borrowing and convergence, particularly in fricatives and glottalics. No major updates to these consonant models have emerged since 2023, leaving Bender's and Ehret's frameworks as the primary references despite ongoing refinements in subgroup phonologies.
FeatureBender (2000)Ehret (2001)
Total Consonants1825+
Stops (plain)/p, t, k, b, d, g//p, t, k, b, d, g/
Fricatives/f, s, h, θ//f, s, h, θ, χ/ (uvulars)
Glottalized/EjectivesNone/p', t', k'/
Other SeriesLabialized (e.g., /kʷ/)Implosives (/ɓ, ɗ/), uvulars
BasisComparative across core branches300+ etymologies, subgroup reflexes

Vowel systems and suprasegmentals

The vowel systems of Nilo-Saharan languages exhibit significant diversity across branches, with reconstructions for Proto-Nilo-Saharan reflecting a relatively simple inventory that expanded in daughter languages through processes like . M. Lionel Bender proposed a seven-vowel system for Proto-Nilo-Saharan, consisting of /i, e, a, o, u, , /, where the high central s // and // are considered marginal or derived from earlier schwa-like elements in some branches. In contrast, reconstructed a more elaborate ten- system, including front rounded s such as /y/ and /ø/, alongside the basic five- series /i, e, a, o, u/ and their [+ATR] counterparts, arguing that this structure accounts for reflexes in major subgroups like Eastern Sudanic and Central Sudanic. These reconstructions highlight ongoing debates, as distinctions often blur due to historical mergers and areal pressures. Vowel harmony, particularly advanced tongue root (ATR) harmony, is a prominent feature in many Nilo-Saharan branches, influencing vowel quality across boundaries. In , such as those in the Nilotic and Surmic groups, ATR harmony typically operates as a cross-height system, where [+ATR] vowels (e.g., /i, e, o, u/) trigger harmony in suffixes and affixes, while [-ATR] vowels (e.g., /ɪ, ɛ, ɔ, ʊ/) form an opposing set; this pattern is reconstructed for Proto-Eastern Sudanic and persists in languages like Luo and Didinga. show similar ATR systems in their eastern varieties, with nine or ten vowels participating in harmony (e.g., Mangbetu), though western branches like Sara exhibit reduced systems without full ATR contrasts, often limited to six underlying vowels. Branch-specific traits include voice quality modifications in Central Sudanic, where [-ATR] vowels are frequently realized with breathy or muffled , contrasting with in [+ATR] vowels, a feature linked to areal interactions in the Macro-Sudan Belt. Tone serves as a key suprasegmental feature in most Nilo-Saharan branches, predominantly employing register tones with high and low levels to distinguish lexical and grammatical meanings. Proto-Nilo-Saharan is reconstructed with a basic high tone (*H) as the primary marker, with low tone (*L) emerging secondarily in many descendants, though full systems vary from two-tone (high/low) setups in Central Sudanic (e.g., Bongo-Bagirmi) to three-tone systems with contours in Northeastern Nilo-Saharan. Tone is absent or minimal in Songhay, where stress accent predominates in most varieties (though Dendi shows tonal traces), but robust in Nilotic languages like Luo, which feature level tones with downstep and upstep for morphological contrasts. Downstep, a lowering of high tone after a floating low, is characteristic of Saharan languages (e.g., Kanuri, Beria), contributing to terraced-level effects in phrases. Other suprasegmentals include distinctions and structure constraints, which interact with tone in many languages. is contrastive in Western Nilotic (e.g., Shilluk contrasts short, long, and overlong vowels), often correlating with tonal stability, while Central Sudanic allows phonemic length in open syllables. The typical structure is CV(C), permitting optional codas in closed syllables (e.g., CVC in ), though open CV predominates in Northeastern branches; this template supports tonal anchoring on vowels. Variations arise from substrate influences, such as tonal systems in Central Sudanic potentially shaped by contact with non-tonal , complicating reconstructions and leading to irregular harmony patterns in border areas.

Morphology

Nominal features

Nominal morphology in Nilo-Saharan languages is characterized by a focus on number marking rather than extensive noun class systems typical of neighboring phyla like Niger-Congo. Unlike , where nouns are grouped into concordial classes, Nilo-Saharan noun categorization primarily revolves around inherent semantic number properties, with affixes, tone, and vowel alternations serving to derive singular or plural forms. This system reflects a typological emphasis on and collectivity, often aligning with hierarchies but without obligatory agreement across the . Given the debated genetic unity of the , these features represent proposed shared traits in major classifications. A prominent feature is the tripartite noun classification based on number patterns, widespread in branches like Eastern Sudanic and Nilotic. Nouns fall into three categories: inherently singular (unmarked in singular, suffixed or toned for plural, e.g., mass nouns like 'clay' in some varieties); inherently plural (unmarked in plural, marked with singulatives for singular, common for collectives like 'cattle'); and those requiring marking in both numbers (e.g., via replacive suffixes). In Nilotic languages such as Sengwer, this manifests through affixes like the singular prefix *ke- in Dinka (a variant of *ki- in reconstructions) for derived nouns, contrasting with plural forms using *ka-. This system categorizes nouns semantically to some extent, with humans and animals often showing more individuated marking than inanimates, though the primary driver is number rather than strict semantic classes. Primary number marking in Dinka for basic human or countable nouns involves vowel and tone changes. Gender marking is not widespread across the but appears in specific branches, often limited to two categories: masculine and feminine. In , such as Kanuri, distinctions influence pronominal reference but are less overt on nouns themselves, with masculine forms typically unmarked and feminine marked by suffixes or vowel changes in some varieties. Central Sudanic branches like Koman exhibit clearer nominal , with reconstructed markers such as *-ɗ(i) for masculine and *-ɓ(i) for feminine, applied to human nouns and extending to animates in some cases. Number is more consistently marked phylum-wide, using suffixes (e.g., *-an for plural in like Dongolese), (common in Nilotic for emphasis on plurality), or tone shifts, with plurals often derived from singular bases via or ATR alternations. Nominal derivation from verbs is achieved through dedicated nominalizers, particularly in branches like Koman and Eastern Sudanic, where suffixes convert verbal roots into action nouns or agentives. For instance, in Bilugu Opo (Koman), verb roots form nouns via suffixes like *-Vr for abstract actions, preserving root consonants while altering vowels for nominal status. is a productive strategy across the phylum, especially in Songhay, where exocentric compounds combine nouns to denote possession or attributes, such as naa-líí 'young person' (person-child) or naa-úlum 'calf' (cow-child), often without linking elements. This method expands the without heavy reliance on affixation, contrasting with the suffix-heavy systems in Nilotic. Noun class markers in Nilo-Saharan demonstrate greater stability than verbal morphology, serving as diagnostic traits for genetic subgrouping. Proto-reconstructions include deictic elements like *n- for singular and *k- for , retained in and prefixes across Northeastern Nilo-Saharan (e.g., Moru na/ka, Murle ce-ni/ce-gi), and number suffixes such as *-i, *-in, and *-k, which persist in Nilotic and Central Sudanic despite innovations in verb paradigms. This conservatism highlights as a more reliable locus for historical reconstruction compared to verbs, where valency markers vary widely.

Verbal features

Verbal morphology in Nilo-Saharan languages is characterized by a range of derivational extensions that modify valency and aspect, with reconstructions suggesting proto-forms involving vowel sequences and prefixes like *i-. These extensions include stable suffixes and prefixes for causative, applicative, and passive functions, as proposed in comparative studies of the phylum. For instance, the causative is commonly marked by an *i- prefix across branches such as Central Sudanic and Eastern Sudanic, as in Ma'di (Central Sudanic) where 'climb' becomes ī-tú 'make climb'. In Nubian languages (Eastern Sudanic), causative suffixes like -ir or -gir derive transitive stems, with reflexes such as Nobiin -kir from a proto-Nubian -(i)gir, potentially linked to an archaic Nilo-Saharan causative *i- that shifted to *u- ~ o- in prefixes. Applicative extensions, often increasing valency to include beneficiaries, appear in forms like Nubian -tir or -deen, as in Karko (Nubian) kɔ̄ɔ̄l-ɔ́g ɔ̀kwáá-ɲàn 'build the house for me'. Passive constructions are marked by suffixes in Northeastern Nilo-Saharan, such as Kalenjin (Nilotic) stative/passive endings, and in Nile Nubian languages like Nobiin -dakk or Andaandi -katt, often derived from verbs denoting covering or wrapping. Christopher Ehret's reconstruction of proto-Nilo-Saharan identifies verbal extensions as sequences like *-V- for derivational purposes, evident in East Sudanic branches and supporting phylum-wide coherence. Tense-aspect-mood (TAM) marking in Nilo-Saharan exhibits significant variation, reflecting the phylum's typological diversity and relative instability in these categories. In , TAM is primarily suffixing, as seen in Ngiti where pluractional and aspectual suffixes attach to the stem to indicate repeated or continuous actions. , in contrast, employ prefixing for TAM, with subject and aspect markers preceding the , such as in Alur tense prefixes or Lotuxo a-bak-ne 'I struck you' where a- indicates first-person subject. often use tonal marking for TAM distinctions, as in Beria (Saharan) where tone shifts signal tense-aspect contrasts, or Dinka (Western Nilotic) where tonal inflections encode marked nominative alignment and aspect. This prefixing-suffixing-tonal divide highlights branch-specific developments within the family. Serial verb constructions (SVCs) are a prevalent feature in Nilo-Saharan for forming complex predicates, particularly in Songhay and Saharan branches, where multiple verbs share arguments and TAM to express compounded events without subordination markers. In Songhay languages like Zarma and Koyraboro Senni, SVCs grammaticalize aspect or directionality, as in Koyraboro Senni fur-ganda 'put down' combining motion and placement verbs into a single prosodic unit. Saharan languages employ SVCs and converbs for sequential actions, exemplified in Beria dèī kí-dí-é k-ʊ́ár-ɪ́ 'he put [his] foot into it and turned it over', where verbs chain to denote manner or result. These constructions underscore the phylum's reliance on juxtaposition for syntactic complexity. Branch-specific innovations in verbal morphology include aspectual prefixes in , such as Karko forms distinguishing singular fúr-àŋ g-àà from pluractional tɔ́m-àŋ g-àà, reflecting a shift toward prefixed derivations in Eastern Sudanic. like Kalenjin exhibit extensive agglutinative suffixation for TAM and derivation, as in kee-pal-a:nu:n-é 'to come and dig', combining motion and action. Overall, Nilo-Saharan verbal systems show an agglutinative tendency, with stacked extensions and prefixes/suffixes varying by subgroup, as reconstructed for proto-forms in comparative analyses.

Lexicon

Comparative vocabulary

The comparative vocabulary of Nilo-Saharan languages reveals family resemblances through shared basic across branches, as reconstructed in etymological studies. These cognates, estimated at around 200 reliable forms, support the genetic unity of the despite phonological divergences. Key examples include terms for natural elements and human essentials, where proto-forms exhibit regular sound correspondences, such as initial bilabials or liquids shifting between branches like Central Sudanic and Eastern Sudanic. Basic vocabulary items illustrate core semantic fields. For '', Christopher reconstructs Proto-Nilo-Saharan *mbih or *mbiːh, reflected in cognates like Central Sudanic forms (e.g., Moru mbí). Similarly, the term for '' is *ámā in Ehret's reconstruction, appearing as ama in some Koman languages and bòró in Songhay, highlighting a possible bilabial onset in outliers. These forms underpin arguments for a common , with systematic and tonal patterns preserved across isolates like Songhay. Scholars such as Ehret and Bender have proposed differing reconstructions, reflecting ongoing debate over the phylum's coherence. Body parts provide further evidence of resemblances, often showing consonant correspondences like implosives to stops. Ehret proposes *pɔhin, *bɔhin, or *ɓɔhin for '', cognate with forms such as Nara bɔh and Dinka forms involving nasal elements. For 'head', the proto-form *ɖúːd̪ or *ɖúːɗ (referring to the crown) corresponds to Luo duŋ and Ateso tud, demonstrating retroflex to dental shifts in Nilotic branches. These lexical items, drawn from broad comparative lists, contrast with potential loanwords from Afroasiatic neighbors but align internally via shared suprasegmental features. Cultural terms reflect pastoral adaptations common to many Nilo-Saharan speakers. Ehret reconstructs *pʰeːr for 'cattle' (collective), seen in cognates like Dinka pɛɛr and Nubian variants with aspirated initials, underscoring the phylum's association with herding economies. Such vocabulary, including related terms for livestock, appears in etymological compilations alongside basic lexicon, totaling over 200 proposed cognates that establish scale for reconstruction efforts. Numerals exhibit a tripartite system in some branches, with independent forms for 1-4 and compounds thereafter. Ehret's reconstructions include *ɗéh for 'one' ( with Dinka diɛk), *mbar for 'two' (e.g., Kanuri mbàr), *híno᷅āh for 'three', and *ɔŋwal for 'four', up to *wáyéh for 'ten'. These basic numerals, part of shared etymological sets, show consistent patterns like initial nasals or liquids, aiding subclassification.
NumeralProto-Nilo-Saharan (Ehret 2001)Example Cognates
One*ɗéhDinka diɛk
Two*mbarKanuri mbàr
Three*híno᷅āhLuo adek (shifted), Nara hin
Four*ɔŋwalAteso angal, Nubian anwal
Ten*wáyéhDinka way, Moru wai

Shared etymologies and numerals

One key piece of evidence for the genetic unity of Nilo-Saharan languages is found in their shared numeral systems and select etymologies, which demonstrate regular correspondences across branches. Numeral systems in many Nilo-Saharan languages follow a tripartite structure, with basic forms for 1–4 or 5, compounds for 5–9 (often based on a base like "hand"), and multiples for 10 and above, a pattern distinct from neighboring families such as Afroasiatic or Niger-Congo. For example, subgroup reconstructions include *tu(r)- for "one" in Proto-Saharan (seen in Kanuri tild and Tubu tur), *sV- or *cu- for "two" ( with Maban soen and Koman s(w)iiya), and *a(k)su- for "three" (parallel to Songhay ahaku and Daju *kodos). This structure is unique to Nilo-Saharan and supports deeper relatedness, as proposed in comparative studies of Saharan and other subgroups. Deeper etymologies further bolster this unity, with over 100 reconstructed items proposed by scholars like M. Lionel Bender and , drawing from core vocabulary across branches. Bender's comparative wordlists identify cognates in basic , while Ehret's reconstructs more than 200 proto-forms, including agricultural terms linked to the phylum's origins around the in the Valley and regions, where early and cultivation emerged. These etymologies, particularly those tied to and , align with archaeological evidence of Nilo-Saharan speakers' role in expansions. Methodologically, these reconstructions rely on regular sound laws, such as *p > f in Western Saharan branches (e.g., Kanuri f from proto-Sahara *p in numerals and other items), and have been validated computationally through . George Starostin's 2016 analysis, using automated detection on Swadesh lists, supports the coherence of major subgroups like Central Sudanic and Eastern Sudanic while highlighting weaker links for peripheral branches and overall challenges to unity. However, gaps persist, particularly for isolates like Gumuz and Koman, where etymological evidence is limited to pronouns and basic numerals, and no major new etymologies have emerged since 2023 despite ongoing fieldwork.

References

  1. https://en.wiktionary.org/wiki/Appendix:Proto-Nilo-Saharan_reconstructions
Add your contribution
Related Hubs
User Avatar
No comments yet.