Hubbry Logo
DiphthongDiphthongMain
Open search
Diphthong
Community hub
Diphthong
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Diphthong
Diphthong
from Wikipedia

American English pronunciation of "no highway cowboys" /noʊ ˈhaɪweɪ ˈkaʊbɔɪz/, showing five diphthongs: /, , , , ɔɪ/

A diphthong (/ˈdɪfθɒŋ, ˈdɪp-/ DIF-thong, DIP-),[1] also known as a gliding vowel or a vowel glide, is a combination of two adjacent vowel sounds within the same syllable.[2] Technically, a diphthong is a vowel with two different targets: that is, the tongue (and/or other parts of the speech apparatus) moves during the pronunciation of the vowel. In most varieties of English, the phrase "no highway cowboys" (/n ˈhw ˈkbɔɪz/ noh HY-way KOW-boyz) has five distinct diphthongs, one in every syllable.

Diphthongs contrast with monophthongs, where the tongue or other speech organs do not move and the syllable contains only a single vowel sound. For instance, in English, the word ah is spoken as a monophthong (/ɑː/), while the word ow is spoken as a diphthong in most varieties (//). Where two adjacent vowel sounds occur in different syllables (e.g. in the English word re-elect) the result is described as hiatus, not as a diphthong.

Diphthongs often form when separate vowels are run together in rapid speech during a conversation. However, there are also unitary diphthongs, as in the English examples above, which are heard by listeners as single-vowel sounds (phonemes).[3]

The word comes from Ancient Greek δίφθογγος (díphthongos) 'two sounds', from δίς (dís) 'twice' and φθόγγος (phthóngos) 'sound'.

Transcription

[edit]

In the International Phonetic Alphabet (IPA), monophthongs are transcribed with one symbol, as in English sun [sʌn], in which ⟨ʌ⟩ represents a monophthong. Diphthongs are transcribed with two symbols, as in English high /haɪ/ or cow /kaʊ/, in which ⟨⟩ and ⟨⟩ represent diphthongs.

Diphthongs may be transcribed with two vowel symbols or with a vowel symbol and a semivowel symbol. In the words above, the less prominent member of the diphthong can be represented with the symbols for the palatal approximant [j] and the labiovelar approximant [w], with the symbols for the close vowels [i] and [u], or the symbols for the near-close vowels [ɪ] and [ʊ]:

vowel and semivowel haj kaw broad transcription
two vowel symbols hai̯ kau̯
haɪ̯ kaʊ̯ narrow transcription

Some transcriptions are broader or narrower (less precise or more precise phonetically) than others. Transcribing the English diphthongs in high and cow as ⟨aj aw⟩ or ⟨ai̯ au̯⟩ is a less precise or broader transcription, since these diphthongs usually end in a vowel sound that is more open than the semivowels [j w] or the close vowels [i u]. Transcribing the diphthongs as ⟨aɪ̯ aʊ̯⟩ is a more precise or narrower transcription, since the English diphthongs usually end in the near-close vowels ʊ].

The non-syllabic diacritic, the inverted breve below ⟨◌̯⟩,[4] is placed under the less prominent part of a diphthong to show that it is part of a diphthong rather than a vowel in a separate syllable: [aɪ̯ aʊ̯]. When there is no contrastive vowel sequence in the language, the diacritic may be omitted. Other common indications that the two sounds are not separate vowels are a superscript, ⟨aᶦ aᶷ⟩,[5] or a tie bar, ⟨a͡ɪ a͡ʊ⟩ or ⟨a͜ɪ a͜ʊ⟩.[6] The tie bar can be useful when it is not clear which symbol represents the syllable nucleus, or when they have equal weight.[7] Superscripts are especially used when an on- or off-glide is particularly fleeting.[8]

The period ⟨.⟩ is the opposite of the non-syllabic diacritic: it represents a syllable break. If two vowels next to each other belong to two different syllables (hiatus), meaning that they do not form a diphthong, they can be transcribed with two vowel symbols with a period in between. Thus, lower can be transcribed ⟨ˈloʊ.ɚ⟩, with a period separating the first syllable, /l/, from the second syllable, ⟨ɚ⟩.

The non-syllabic diacritic is used only when necessary. It is typically omitted when there is no ambiguity, as in ⟨haɪ kaʊ⟩. No words in English have the vowel sequences *[a.ɪ a.ʊ], so the non-syllabic diacritic is unnecessary.

Types

[edit]

Falling and rising

[edit]

Falling (or descending) diphthongs start with a vowel quality of higher prominence (higher pitch or volume) and end in a semivowel with less prominence, like [aɪ̯] in eye, while rising (or ascending) diphthongs begin with a less prominent semivowel and end with a more prominent full vowel, similar to the [ja] in yard. (Sometimes, however, the terms "falling" and "rising" are used, instead, to refer to vowel height, i.e. as synonyms of the terms "closing" and "opening".[9] See below.) The less prominent component in the diphthong may also be transcribed as an approximant, thus [aj] in eye and [ja] in yard. However, when the diphthong is analysed as a single phoneme, both elements are often transcribed with vowel symbols (/aɪ̯/, /ɪ̯a/). Semivowels and approximants are not equivalent in all treatments, and in the English and Italian languages, among others, many phoneticians do not consider rising combinations to be diphthongs, but rather sequences of approximant and vowel. There are many languages (such as Romanian) that contrast one or more rising diphthongs with similar sequences of a glide and a vowel in their phonetic inventory[10] (see semivowel for examples).

Closing, opening, and centering

[edit]
Vowel diagram illustrating closing diphthongs of Belgian Standard Dutch, from Verhoeven (2005:245)
Vowel diagram illustrating centering diphthongs of the Dutch dialect of Orsmaal-Gussenhoven, from Peters (2010:241)

In closing diphthongs, the second element is more close than the first (e.g. [ai]); in opening diphthongs, the second element is more open (e.g. [ia]). Closing diphthongs tend to be falling ([ai̯]), and opening diphthongs are generally rising ([i̯a]),[11] as open vowels are more sonorous and therefore tend to be more prominent. However, exceptions to this rule are not rare in the world's languages. In Finnish, for instance, the opening diphthongs /ie̯/ and /uo̯/ are true falling diphthongs, since they begin louder and with higher pitch and fall in prominence during the diphthong.

A third, rare type of diphthong that is neither opening nor closing is height-harmonic diphthongs, with both elements at the same vowel height.[12] These may have occurred in Old English:

  • beorht [beo̯rxt] "bright"
  • ċeald [t͡ʃæɑ̯ld] "cold"

A centering diphthong is one that begins with a more peripheral vowel and ends with a more central one, such as [ɪə̯], [ɛə̯], and [ʊə̯] in Received Pronunciation or [iə̯] and [uə̯] in Irish. Many centering diphthongs are also opening diphthongs ([iə̯], [uə̯]).

Diphthongs may contrast in how far they open or close. For example, Samoan contrasts low-to-mid with low-to-high diphthongs:

  • 'ai [ʔai̯] 'probably'
  • 'ae [ʔae̯] 'but'
  • 'auro [ʔau̯ɾo] 'gold'
  • ao [ao̯] 'a cloud'

Narrow and wide

[edit]

Narrow diphthongs are the ones that end with a vowel which on a vowel chart is quite close to the one that begins the diphthong, for example Northern Dutch [eɪ], [øʏ] and [oʊ]. Wide diphthongs are the opposite – they require a greater tongue movement, and their offsets are farther away from their starting points on the vowel chart. Examples of wide diphthongs are RP/GA English [aɪ] and [aʊ].

Length

[edit]

Languages differ in the length of diphthongs, measured in terms of morae. In languages with phonemically short and long vowels, diphthongs typically behave like long vowels, and are pronounced with a similar length.[13] In languages with only one phonemic length for pure vowels, however, diphthongs may behave like pure vowels.[14] For example, in Icelandic, both monophthongs and diphthongs are pronounced long before single consonants and short before most consonant clusters.

Some languages contrast short and long diphthongs. In some languages, such as Old English, these behave like short and long vowels, occupying one and two morae, respectively. Languages that contrast three quantities in diphthongs are extremely rare, but not unheard of; Northern Sami is known to contrast long, short and "finally stressed" diphthongs, the last of which are distinguished by a longer second element.[15]: 149 

Phonology

[edit]

In some languages, diphthongs are single phonemes, while in others they are analyzed as sequences of two vowels, or of a vowel and a semivowel.

Sound changes

[edit]

Certain sound changes relate to diphthongs and monophthongs. Vowel breaking or diphthongization is a vowel shift in which a monophthong becomes a diphthong. Monophthongization or smoothing is a vowel shift in which a diphthong becomes a monophthong.

Difference from semivowels and vowel sequences

[edit]

While there are a number of similarities, diphthongs are not the same phonologically as a combination of a vowel and an approximant or glide. Most importantly, diphthongs are fully contained in the syllable nucleus[16][17] while a semivowel or glide is restricted to the syllable boundaries (either the onset or the coda). This often manifests itself phonetically by a greater degree of constriction,[18] but the phonetic distinction is not always clear.[19] The English word yes, for example, consists of a palatal glide followed by a monophthong rather than a rising diphthong. In addition, the segmental elements must be different in diphthongs [ii̯] and so when it occurs in a language, it does not contrast with [iː]. However, it is possible for languages to contrast [ij] and [iː].[20]

Diphthongs are also distinct from sequences of simple vowels. The Bunaq language of Timor, for example, distinguishes /sa͡i/ [saj] 'exit' from /sai/ [saʲi] 'be amused', /te͡i/ [tej] 'dance' from /tei/ [teʲi] 'stare at', and /po͡i/ [poj] 'choice' from /loi/ [loʷi] 'good'[clarification needed].[21]

Some languages or dialects also articulate the component sounds of a diphthong differently than when those sounds are produced in hiatus. For example, due to English diphthong raising, many North American English speakers pronounce /aɪ/ with closer vowels than /a.ɪ/, and, among a subset of those, the diphthong /aʊ/ may be similarly raised as compared to /a.ʊ/.

Examples

[edit]

Indo-European languages

[edit]

English

[edit]

In words coming from Middle English, most cases of the Modern English diphthongs [aɪ̯, oʊ̯, eɪ̯, aʊ̯] originate from the Middle English long monophthongs [iː, ɔː, aː, uː] through the Great Vowel Shift, although some cases of [oʊ̯, eɪ̯] originate from the Middle English diphthongs [ɔu̯, aɪ̯].

Standard English diphthongs
English
diaphoneme
RP (British) Australian S.W. Irish North American
GenAm Canadian
low ⫽oʊ⫽ [əʉ̯] [ɜʉ̯ ~ ɐʉ̯] [oː] [o̞ʊ̯ ~ ʌʊ̯ ~ ɔʊ̯ ~ o̞]
loud ⫽aʊ⫽ [aʊ̯] [æɔ̯] [ɐʊ̯ ~ ʌʊ̯] [aʊ̯ ~ æʊ̯][t2 1] [aʊ̯ ~ æʊ̯]
lout [ʌʊ̯][t2 2]
lied ⫽aɪ⫽ [ɑ̈ɪ̯ ~ ʌ̞ɪ̯] [ɑ̟e̯ ~ ɑe̯] [æɪ̯ ~ ɐɪ̯] [äɪ̯ ~ ɑɪ̯][t2 3]
light [ʌɪ̯ ~ ɜɪ̯ ~ ɐɪ̯][t2 2]
lay ⫽eɪ⫽ [e̞ɪ̯ ~ ɛɪ̯] [æɪ̯ ~ ɐɪ̯] [eː] [eɪ̯ ~ e]
loin ⫽ɔɪ⫽ [oɪ̯ ~ ɔ̝ɪ̯] [oɪ̯] [əɪ̯ ~ ɑɪ̯] [ɔɪ̯ ~ oɪ̯]
loon ⫽uː⫽[t2 4] [ʊ̈ʉ̯ ~ ʉː ~ ɨ̞ɯ̯̈] [ʊ̈ʉ̯] [uː] [u̟ ~ ʊu̯ ~ ʉu̯ ~ ɵu̯] [ʉu̯]
lean ⫽iː⫽[t2 4] [ɪi̯ ~ iː] [ɪi̯ ~ əi̯] [iː] [ɪi̯ ~ i]
leer ⫽ɪər⫽ [ɪ̞ː ~ ëː ~ ɪə̯] [ɪə̯ ~ ɪː ~ ~ iə̯] [iːɹ] [ɪɹ ~ iɹ]
lair ⫽ɛər⫽ [ɛː ~ ɛə̯] [e̞ː ~ eː] [eːɹ] [ɛɹ]
lure ⫽ʊər⫽ [ɵː ~ ɤ̈ː, o̞ː] [ʊ̈ʉ̯ə, oː] [uːɹ] [ʊɹ ~ ɔɹ ~ oɹ]
  1. ^ In Pittsburgh English, /aʊ̯/ is monophthongal [aː], leading to the stereotypical spelling "Dahntahn" for "downtown".
  2. ^ a b Canadian English and some dialects of northern American English exhibit allophony of /aʊ̯/ and /aɪ̯/ called Canadian raising – in some places they have become separate phonemes. GA has raising to a lesser extent in /aɪ̯/.
  3. ^ In several American dialects such as Southern American English, /aɪ̯/ becomes monophthongal [aː] except before voiceless consonants.
  4. ^ a b The erstwhile monophthongs /iː/ and /uː/ are diphthongized in many dialects. In many cases they might be better transcribed as [uu̯] and [ii̯], where the non-syllabic element is understood to be closer than the syllabic element. They are sometimes transcribed /uw/ and /ij/.

Dutch

[edit]
Diphthongs of Dutch
Netherlandic[22] Belgian[23]
zeis, ijs [ɛɪ̯]
ui [œʏ̯]
zout, louw [ɑʊ̯] [ɔʊ̯]
leeuw [eːʊ̯]
nieuw [iʊ̯]
duw [yʊ̯]
dooi [o:ɪ̯]
saai [a:ɪ̯]
loei [uɪ̯]
beet[t1 1] [eɪ̯] [eː]
neus[t1 1] [øʏ̯] [øː]
boot[t1 1] [oʊ̯] [oː]
  1. ^ a b c [eɪ̯], [øʏ̯], and [oʊ̯] are normally pronounced as closing diphthongs except when preceding [ɾ] and even [ɫ], in which case they are either centering diphthongs: [eə̯], [øə̯], and [oə̯] or are lengthened and monophthongized to [ɪː], [ʏː], and [ʊː]

The dialect of Hamont (in Limburg) has five centring diphthongs and contrasts long and short forms of [ɛɪ̯], [œʏ̯], [ɔʊ̯], and [ɑʊ̯].[24]

Afrikaans

[edit]

The Afrikaans language has its origin in Dutch but differs in many significant ways, including the use of diphthongs in the place of several non-diphthong Dutch double vowels, or double-vowels being pronounced differently. Examples include:

  • ee as in leer
  • eu as in deur
  • ui as in buite
Diphthong phonemes[25][26]
Starting point Ending point
Front Central Back
Closed unrounded iʊ̯
rounded uɪ̯
Mid unrounded əɪ̯ ɪə
rounded œɪ̯, ɔɪ̯, oːɪ̯ ʏə, ʊə œʊ̯
Open unrounded aɪ̯, ɑːɪ̯
  • Falling diphthongs. Their first element may be short [ɪə̯, ʊə̯] or somewhat lengthened [ɪˑə̯, ʊˑə̯].[26]
  • Rising diphthongs [ɪ̯ə, ʊ̯ə]. These variants do not seem to appear word-finally. The sequence /ɦʊə/ is commonly realised as [ɦʊ̯ə] or, more often, [ɦʊ̯ə̤], with /ɦ/ realised as breathy
  • The scholar Daan Wissing argues that /əɪ̯/ is not a phonetically correct transcription and that /æɛ̯/ is more accurate. In his analysis, he found that [æɛ̯] makes for 65% of the realisations, the other 35% being monophthongal, [ə], [æ] and [ɛ].[27]
  • Most often, /œɪ̯/ has an unrounded offset. For some speakers, the onset is also unrounded. That can cause /œɪ̯/ to merge with /əɪ̯/, which is considered non-standard.[28]
  • /ɔɪ̯, aɪ̯/ occur mainly in loanwords.[28]
  • Older sources describe /œu/ as a narrow back diphthong [ou].[29][30] However, newer sources describe its onset as more front. For example, Lass (1984), states that the onset of /œu/ is central [ɵu].[31]
    • In some words which, in English, are pronounced with /əʊ̯/, the Afrikaans equivalent tends to be pronounced with /œʊ̯/, rather than /ʊə/. That happens because Afrikaans /œʊ̯/ is more similar to the usual South African realization of English /əʊ̯/.[29]
Example words for diphthongs
Phoneme IPA Orthography Gloss
/ɪø/ /sɪøn/ seun 'son'
/əɪ̯/ /ɦəɪ̯/ hy 'he'
/ɪə/ /vɪət/ weet 'to know'
/œɪ̯/ /ɦœɪ̯s/ huis 'house'
/ɔɪ̯/ /ˈχɔɪ̯əŋ/ goiing 'burlap'
/ʊə/ /brʊət/ brood 'bread'
/œʊ̯/ /kœʊ̯t/ koud 'cold'
/aɪ̯/ /ˈbaɪ̯ə/ baie 'many'

The long diphthongs (or 'double vowels') are phonemically sequences of a free vowel and a non-syllabic equivalent of /i/ or /u/: [iu, ui, oːi, eu, ɑːi]. Both [iu] and [eu] tend to be pronounced as [iu], but they are spelled differently: the former as ⟨ieu⟩, the latter as ⟨eeu⟩.[32]

In diminutives ending in /ki/ formed to monosyllabic nouns, the vowels /u, ɪə, ʊə, ɛ, ə, œ, ɔ, a, ɑː/ are realised as closing diphthongs [ui, ei, oi, ɛi, əi, œi, ɔi, ai, ɑːi]. In the same environment, the sequences /ɛn, ən, œn, ɔn, an/ are realized as [ɛiɲ, əiɲ, œiɲ, ɔiɲ, aiɲ], i.e. as closing diphthongs followed by palatal nasal.[33]

  • The suffixes ⟨-aad⟩ and ⟨-aat⟩ (phonemically /ɑːd/ and /ɑːt/, respectively) and the diminutive suffix /ki/ are realised as [ɑːki] (with a monophthong), rather than [ɑːiki].[28]
  • In practice, the diphthong [əi] is realised the same as the phonemic diphthong /əi/.[34]
  • [œi], when it has arisen from diphthongisation of [œ], differs from the phonemic diphthong /œi/ by having a slightly different onset, although the exact nature of that difference is unclear. This means that puntjie 'point' sounds somewhat different from puintjie 'rubble'.[34]

German

[edit]
Standard German
[edit]

Phonemic diphthongs in German:

  • /aɪ̯/ as in Ei 'egg'
  • /aʊ̯/ as in Maus 'mouse'
  • /ɔʏ̯/ as in neu 'new'

In the varieties of German that vocalize the /r/ in the syllable coda, other diphthongal combinations may occur. These are only phonetic diphthongs, not phonemic diphthongs, since the vocalic pronunciation [ɐ̯] alternates with consonantal pronunciations of /r/ if a vowel follows, cf. du hörst [duː ˈhøːɐ̯st] 'you hear' – ich höre [ʔɪç ˈhøːʀə] 'I hear'. These phonetic diphthongs may be as follows:

German diphthongs ending in [ɐ̯] (part 1), from Kohler (1999:88)
German diphthongs ending in [ɐ̯] (part 2), from Kohler (1999:88)
Diphthong Example
Phonemically Phonetically IPA Orthography Translation
/iːr/ [iːɐ̯]1 [viːɐ̯] wir we
/yːr/ [yːɐ̯]1 [fyːɐ̯] für for
/uːr/ [uːɐ̯]1 [ˈʔuːɐ̯laʊ̯pʰ] Urlaub holiday
/ɪr/ [ɪɐ̯] [vɪɐ̯tʰ] wird he/she/it becomes
/ʏr/ [ʏɐ̯] [ˈvʏɐ̯də] Würde dignity
/ʊr/ [ʊɐ̯] [ˈvʊɐ̯də] wurde I/he/she/it became
/eːr/ [eːɐ̯]1 [meːɐ̯] mehr more
/øːr/ [øːɐ̯]1 [høːɐ̯] hör! (you) hear!
/oːr/ [oːɐ̯]1 [tʰoːɐ̯] Tor gate/goal (in football)
/ɛːr/ [ɛːɐ̯]1 [bɛːɐ̯] Bär bear
/ɛr/ [ɛɐ̯] [ʔɛɐ̯ftʰ] Erft Erft
/œr/ [œɐ̯] [dœɐ̯tʰ] dörrt he/she/it dries
/ɔr/ [ɔɐ̯] [ˈnɔɐ̯dn̩] Norden north
/aːr/ [aːɐ̯]1 [vaːɐ̯] wahr true
/ar/ [aɐ̯] [haɐ̯tʰ] hart hard
^1 Wiese (1996) notes that the length contrast is not very stable before non-prevocalic /r/[35] and that "Meinhold & Stock (1980:180), following the pronouncing dictionaries (Mangold (1990), Krech & Stötzer (1982)) judge the vowel in Art, Schwert, Fahrt to be long, while the vowel in Ort, Furcht, hart is supposed to be short. The factual basis of this presumed distinction seems very questionable."[35][36] He goes on stating that in his own dialect, there is no length difference in these words, and that judgements on vowel length in front of non-prevocalic /r/ which is itself vocalized are problematic, in particular if /a/ precedes.[35]
According to the 'lengthless' analysis, the aforementioned 'long' diphthongs are analyzed as [iɐ̯], [yɐ̯], [uɐ̯], [eɐ̯], [øɐ̯], [oɐ̯], [ɛɐ̯] and [aɐ̯]. This makes non-prevocalic /aːr/ and /ar/ homophonous as [aɐ̯] or [aː]. Non-prevocalic /ɛːr/ and /ɛr/ may also merge, but the vowel chart in Kohler (1999:88) shows that they have somewhat different starting points.
Wiese (1996) also states that "laxing of the vowel is predicted to take place in shortened vowels; it does indeed seem to go hand in hand with the vowel shortening in many cases."[35]
Bernese German
[edit]

The diphthongs of some German dialects differ from standard German diphthongs. The Bernese German diphthongs, for instance, correspond rather to the Middle High German diphthongs than to standard German diphthongs:

  • /iə̯/ as in lieb 'dear'
  • /uə̯/ as in guet 'good'
  • /yə̯/ as in müed 'tired'
  • /ei̯/ as in Bei 'leg'
  • /ou̯/ as in Boum 'tree'
  • /øi̯/ as in Böim 'trees'

Apart from these phonemic diphthongs, Bernese German has numerous phonetic diphthongs due to L-vocalization in the syllable coda, for instance the following ones:

  • [au̯] as in Stau 'stable'
  • [aːu̯] as in Staau 'steel'
  • [æu̯] as in Wäut 'world'
  • [æːu̯] as in wääut 'elects'
  • [ʊu̯] as in tschúud 'guilty'

Yiddish

[edit]

Yiddish has three diphthongs:[37]

  • [ɛɪ̯] as in [plɛɪ̯tə] פּליטה ('refugee' f.)
  • [aɛ̯] as in [naɛ̯n] נײַן ('nine')
  • [ɔə̯] as in [ɔə̯fn̩] אופֿן ('way')

Diphthongs may reach a higher target position (towards /i/) in situations of coarticulatory phenomena or when words with such vowels are being emphasized.

Norwegian

[edit]

There are five diphthongs in the Oslo dialect of Norwegian, all of them falling:

  • [æɪ] as in nei, "no"
  • [œʷʏʷ] as in øy, "island"
  • [æʉ͍] as in sau, "sheep"
  • [ɑɪ] as in hai, "shark"
  • [ɔʷʏʷ] as in joik, "Sami song"

An additional diphthong, [ʉ͍ɪ], occurs only in the word hui in the expression i hui og hast "in great haste". The number and form of diphthongs vary between dialects.

Faroese

[edit]

Diphthongs in Faroese are:

  • /ai/ as in bein (can also be short)
  • /au/ as in havn
  • /ɛa/ as in har, mær
  • /ɛi/ as in hey
  • /ɛu/ as in nevnd
  • /œu/ as in nøvn
  • /ʉu/ as in hús
  • /ʊi/ as in mín, , (can also be short)
  • /ɔa/ as in ráð
  • /ɔi/ as in hoyra (can also be short)
  • /ɔu/ as in sól, ovn

Icelandic

[edit]

Diphthongs in Icelandic are the following:

  • /au̯/ as in átta, "eight"
  • /ou̯/ as in nóg, "enough"
  • /øi̯/ as in auga, "eye"
  • /ai̯/ as in kær, "dear"
  • /ei̯/ as in þeir, "they"
  • /ɔi/ as in koja, "bunk bed", "berth" (rare, only in handful of words)

Combinations of semivowel /j/ and a vowel are the following:

  • /jɛ/ as in éta, "eat"
  • /ja/ as in jata, "manger"
  • /jau̯/ as in , "yes"
  • /jo/ as in joð, "iodine", "jay", "yod" (only in a handful of words of foreign origin)
  • /jou̯/ as in jól, "Christmas"
  • /jœ/ as in jötunn, "giant"
  • /jai̯/ as in jæja, "oh well"
  • /ju/ as in , "yes"

French

[edit]

In French, /wa/, /wɛ̃/, /ɥi/ and /ɥɛ̃/ may be considered true diphthongs (that is, fully contained in the syllable nucleus: [u̯a], [u̯ɛ̃], [y̯i], [y̯ɛ̃]). Other sequences are considered part of a glide formation process that turns a high vowel into a semivowel (and part of the syllable onset) when followed by another vowel.[38]

Diphthongs

  • /wa/ [u̯a] as in roi "king", coi "quiet", croix "cross", doigt "finger", droit "right", foi "faith", loi "law", proie "prey", soie "silk", toit "roof", voie "way", voix "voice"
  • /wɛ̃/ [u̯ɛ̃] as in groin "muzzle", coin "corner", coing "quince", foin "hay", moins "less", loin "far", point "point", poing "fist", soin "care"
  • /ɥi/ [y̯i] as in huit "eight", bruit "noise", buis "boxwood", fruit "fruit", fuite "leak", luire "glow", nuire "harm", nuit "night", pluie "rain", suite "continuation"
  • /ɥɛ̃/ [y̯ɛ̃] as in juin "June", suint "ooze (tar)"

Semivowels

  • /wi/ as in oui "yes"
  • /jɛ̃/ as in lien "bond"
  • /jɛ/ as in Ariège
  • /je/ as in pied "foot"
  • /ji/ as in yin
  • /aj/ as in travail "work"
  • /ɛj/ as in Marseille
  • /ij/ as in bille "ball"
  • /œj/ as in feuille "leaf"
  • /uj/ as in grenouille "frog"
  • /jø/ as in vieux "old"
Quebec French
[edit]

In Quebec French, long vowels are generally diphthongized in informal speech when stressed.

  • [ɑɔ̯] as in tard "late"
  • [aɛ̯] as in père "father"
  • [aœ̯] as in fleur "flower"
  • [ou̯] as in autre "other"
  • [øy̯] as in neutre "neutral"
  • [ãʊ̯̃] as in banque "bank"
  • [ẽɪ̯̃] as in mince "thin"
  • [ɒ̃ʊ̯̃] as in bon "well"
  • [œ̃ʏ̯̃] as in un "one"

Catalan

[edit]

Catalan possesses a number of phonetic diphthongs, all of which begin (rising diphthongs) or end (falling diphthongs) in [j] or [w].[39]

Catalan diphthongs
falling
[aj] aigua 'water' [aw] taula 'table'
[əj] mainada 'children' [əw] caurem 'we will fall'
[ɛj] remei 'remedy' [ɛw] peu 'foot'
[ej] rei 'king' [ew] seu 'his/her'
[iw] niu 'nest'
[ɔj] noi 'boy' [ɔw] nou 'new'
[ow] jou 'yoke'
[uj] avui 'today' [uw] duu 'he/she is carrying'
rising
[ja] iaia 'grandma' [wa] quatre 'four'
[jɛ] veiem 'we see' [wɛ] seqüència 'sequence'
[je] seient 'seat' [we] ungüent 'ointment'
[jə] feia 'he/she was doing' [wə] qüestió 'question'
[wi] pingüí 'penguin'
[jɔ] iode 'iodine' [wɔ] quota 'payment'
[ju] iogurt 'yoghurt'

In standard Eastern Catalan, rising diphthongs (that is, those starting with [j] or [w]) are possible only in the following contexts:[40]

  • [j] in word initial position, e.g. iogurt.
  • Both occur between vowels as in feia and veiem.
  • In the sequences [ɡw] or [kw] and vowel, e.g. guant, quota, qüestió, pingüí (these exceptional cases even lead some scholars[41] to hypothesize the existence of rare labiovelar phonemes /ɡʷ/ and /kʷ/).[42]

There are also certain instances of compensatory diphthongization in the Majorcan dialect so that /ˈtroncs/ ('logs') (in addition to deleting the palatal plosive) develops a compensating palatal glide and surfaces as [ˈtrojns] (and contrasts with the unpluralized [ˈtronʲc]). Diphthongization compensates for the loss of the palatal stop (part of Catalan's segment loss compensation). There are other cases where diphthongization compensates for the loss of point of articulation features (property loss compensation) as in [ˈaɲ] ('year') vs [ˈajns] ('years').[43] The dialectal distribution of this compensatory diphthongization is almost entirely dependent on the dorsal plosive (whether it is velar or palatal) and the extent of consonant assimilation (whether or not it is extended to palatals).[44]

Portuguese

[edit]

The Portuguese diphthongs are formed by the labio-velar approximant [w] and palatal approximant [j] with a vowel,[45] European Portuguese has 14 phonemic diphthongs (10 oral and 4 nasal),[46] all of which are falling diphthongs formed by a vowel and a nonsyllabic high vowel. Brazilian Portuguese has roughly the same amount, although the European and non-European dialects have slightly different pronunciations ([ɐj] is a distinctive feature of some southern and central Portuguese dialects, especially that of Lisbon). A [w] onglide after /k/ or /ɡ/ and before all vowels as in quando [ˈkwɐ̃du] ('when') or guarda [ˈɡwaɾðɐ ~ ˈɡwaʁdɐ] ('guard') may also form rising diphthongs and triphthongs. Additionally, in casual speech, adjacent heterosyllabic vowels may combine into diphthongs and triphthongs or even sequences of them.[47]

Falling diphthongs of Portuguese
oral
EP[46] BP EP BP
sai [aj] mau [aw]
sei [ɐj]/[ej] [ej] meu [ew]
anéis [ɛj] véu [ɛw]
viu [iw]
mói [ɔj]
moita [oj] dou [ow]
fui [uj]
nasal
mãe [ɐ̃j] [ɐ̃j] mão [ɐ̃w]
cem [ẽj]
anões [õj]
muita [ũj]

In addition, phonetic diphthongs are formed in most Brazilian Portuguese dialects by the vocalization of /l/ in the syllable coda with words like sol [sɔw] ('sun') and sul [suw] ('south') as well as by yodization of vowels preceding /s/ or its allophone at syllable coda [ʃ ~ ɕ] in terms like arroz [aˈʁojs ~ ɐˈʁo(j)ɕ] ('rice'),[47] and /z/ (or [ʒ ~ ʑ]) in terms such as paz mundial [ˈpajz mũdʒiˈaw ~ ˈpa(j)ʑ mũdʑiˈaw] ('world peace') and dez anos [ˌdɛjˈz‿ɐ̃nu(j)s ~ ˌdɛjˈz‿ɐ̃nuɕ] ('ten years').

Spanish

[edit]

Phonetically, Spanish has seven falling diphthongs and eight rising diphthongs. In addition, during fast speech, sequences of vowels in hiatus become diphthongs wherein one becomes non-syllabic (unless they are the same vowel, in which case they fuse together) as in poeta [ˈpo̯eta] ('poet'), almohada [alˈmo̯aða] ('pillow'), maestro [ˈmae̯stɾo] ('teacher') and línea [ˈline̯a] ('line'). The Spanish diphthongs are:[48][49]

Spanish diphthongs
falling
[ai̯] aire 'air' [au̯] pausa 'pause'
[ei̯] rey 'king' [eu̯] neutro 'neutral'
[oi̯] hoy 'today' [ou̯] bou 'seine fishing'
[ui̯] muy 'very'
rising
[ja] hacia 'towards' [wa] cuadro 'picture'
[je] tierra 'earth' [we] fuego 'fire'
[wi] fuimos 'we went'
[jo] radio 'radio' [wo] cuota 'quota'
[ju] viuda 'widow'

Italian

[edit]

The existence of true diphthongs in Italian is debated; however, a list is:[50]

Italian diphthongs
falling
[ai̯] baita 'mountain hut' [au̯] auto 'car'
[ei̯] potei 'I could' (past tense) [eu̯] pleurite 'pleurisy'
[ɛi̯] sei 'six' [ɛu̯] neutro 'neuter'
[ɔi̯] poi 'later'
[oi̯] voi 'you' (pl.)
[ui̯] lui 'he'
rising
[ja] chiave 'key' [wa] guado 'ford'
[jɛ] pieno 'full' [wɛ] quercia 'oak'
[je] soffietto 'bellows' [we] quello 'that'
[wi] guida 'guide'
[jɔ] chiodo 'nail' [wɔ] quota 'quota'
[jo] fiore 'flower' [wo] acquoso 'watery'
[ju] piuma 'feather'

The second table includes only 'false' diphthongs, composed of a semivowel + a vowel, not two vowels. The situation is more nuanced in the first table: a word such as 'baita' is actually pronounced ['baj.ta] and most speakers would syllabify it that way. A word such as 'voi' would instead be pronounced and syllabified as ['vo.i], yet again without a diphthong.

In general, unstressed /i e o u/ in hiatus can turn into glides in more rapid speech (e.g. biennale [bi̯enˈnaːle] 'biennial'; coalizione [ko̯alitˈtsi̯oːne] 'coalition') with the process occurring more readily in syllables further from stress.[51]

Romanian

[edit]

Romanian has two true diphthongs: /e̯a/ and /o̯a/. There are, however, a host of other vowel combinations (more than any other major Romance language) which are classified as vowel glides. As a result of their origin (diphthongization of mid vowels under stress), the two true diphthongs appear only in stressed syllables[52] and make morphological alternations with the mid vowels /e/ and /o/. To native speakers, they sound very similar to /ja/ and /wa/ respectively.[53] There are no perfect minimal pairs to contrast /o̯a/ and /wa/,[10] and because /o̯a/ does not appear in the final syllable of a prosodic word, there are no monosyllabic words with /o̯a/; exceptions might include voal ('veil') and trotuar ('sidewalk'), though Ioana Chițoran argues[54] that these are best treated as containing glide-vowel sequences rather than diphthongs. In addition to these, the semivowels /j/ and /w/ can be combined (either before, after, or both) with most vowels, while this arguably[55] forms additional diphthongs and triphthongs, only /e̯a/ and /o̯a/ can follow an obstruent-liquid cluster such as in broască ('frog') and dreagă ('to mend'),[56] implying that /j/ and /w/ are restricted to the syllable boundary and therefore, strictly speaking, do not form diphthongs.

Irish

[edit]

All Irish diphthongs are falling.

  • [əi̯], spelled aigh, aidh, agh, adh, eagh, eadh, eigh, or eidh
  • [əu̯], spelled abh, amh, eabh, or eamh
  • [iə̯], spelled ia, iai
  • [uə̯], spelled ua, uai

Scottish Gaelic

[edit]

There are 9 diphthongs in Scottish Gaelic. Group 1 occur anywhere (eu is usually [eː] before -m, e.g. Seumas). Group 2 are reflexes that occur before -ll, -m, -nn, -bh, -dh, -gh and -mh.

Spellings Examples
1 [iə] ia iarr "ask"
[uə] ua fuar "cold"
[ia] eu beul "mouth"
2 [ai] ai saill "grease", cainnt "speech", aimhreit "riot"
[ei] ei seinn "sing"
[ɤi] oi, ei, ai loinn "badge", greim "bite", saighdear "soldier"
[ɯi] ui, aoi druim "back", aoibhneas "joy"
[au] a, ea cam "crooked", ceann "head"
[ɔu] o tom "mound", donn "brown"

For more detailed explanations of Gaelic diphthongs see Scottish Gaelic orthography.

Cornish

[edit]

The following diphthongs are used in the Standard Written Form of Cornish. Each diphthong is given with its Revived Middle Cornish (RMC) and Revived Late Cornish (RLC) pronunciation.

Graph RMC RLC Example
aw [aʊ] [æʊ] glaw "rain"
ay [aɪ] [əɪ] bay "kiss"
ew [ɛʊ] blew "hair"
ey [ɛɪ] [əɪ] bleydh "wolf"
iw [iʊ] [ɪʊ] liw "colour"
ow [ɔʊ] lowen "happy"
oy [ɔɪ] moy "more"
uw [yʊ] [ɪʊ] duw "god"
yw [ɪʊ] [ɛʊ] byw "alive"

Welsh

[edit]

Welsh is traditionally divided into Northern and Southern dialects. In the north, some diphthongs may be short or long according to regular vowel length rules but in the south they are always short (see Welsh phonology). Southern dialects tend to simplify diphthongs in speech (e.g. gwaith /ɡwaiθ/ is reduced to /ɡwaːθ/).

Grapheme North South Example
ae /ɑːɨ/ /ai/ maen 'stone'
ai /ai/ gwaith 'work'
au /aɨ/ haul 'sun'
aw /au, ɑːu/ /au/ mawr 'big'
ei /əi/ /əi/ gweithio 'to work'
eu /əɨ/ treulio 'spend'
ey teyrn 'tyrant'
ew /ɛu, eːu/ /ɛu/ tew 'fat'
oe /ɔɨ, ɔːɨ/ /ɔi/ moel 'bald'
ou cyffrous 'excited'
oi /ɔi/ troi 'turn'
ow /ɔu, oːu/ /ɔu/ brown 'brown'
wy /ʊɨ, uːɨ/ /ʊi/ pwyll 'sense'
iw /ɪu/ /ɪu/ lliw 'colour'
uw /ɨu/ duw 'god'
yw llyw 'rudder'
/əu/ /əu/ tywydd 'weather'
† The plural ending -au is reduced to /a/ in the north and /e/ in the south, e.g. cadau 'battles' is /ˈkada/ (north) or /ˈkade/ (south).

Czech

[edit]

There are three diphthongs in Czech:

  • /aʊ̯/ as in auto (almost exclusively in words of foreign origin)
  • /eʊ̯/ as in euro (in words of foreign origin only)
  • /oʊ̯/ as in koule

The vowel groups ia, ie, ii, io, and iu in foreign words are not regarded as diphthongs, they are pronounced with /j/ between the vowels [ɪja, ɪjɛ, ɪjɪ, ɪjo, ɪju].

Serbo-Croatian

[edit]
  • i(j)e, as in mlijeko[57] (in Ijekavian varieties)

may be pronounced as a diphthong, but also as [ie] in hiatus or separated by a semivowel, [ije]. For example, in the first line of the national anthem of Croatia, Lijepa naša domovina, ije is pronounced as a diphthong, but in the first line of the national anthem of Montenegro, Oj, svijetla majska zoro, ije is pronounced as two syllables.

Some Serbo-Croatian dialects also have uo, as in kuonj, ruod, uon[58] whereas, in Standard Croatian and Serbian, these words are konj, rod, on.

Uralic languages

[edit]

Estonian

[edit]

All nine vowels can appear as the first component of an Estonian diphthong, but only e i o u] occur as the second component.

Common Estonian diphthongs
[ɑe] aed
"fence, garden"
[ɑi] lai
"wide"
[ɑo] kaotama
"to lose"
[ɑu] laud
"table"
[eɑ] teadma
"to know"
[ei] leib
"bread"
[eo] teostus
"accomplishment"
[iu] kiuste
"in spite of"
[oɑ] toa
"room"
(s. possessive)
[oe] koer
"dog"
[oi] toit
"food"
[ui] kui
"when, if"
[ɤɑ] lõa
"tether"
(s. possessive)
[ɤe] nõel
"needle"
[ɤi] õige
"right, correct"
[ɤo] tõotus
"promise"
[ɤu] lõug
"chin"
[æe] päev
"day"
[æi] täis
"full"
[æo] näo
"face" (s. possessive)
[æu] näuguma
"to meow"
[øe] söed
"coals"
[øi] köis
"rope"

There are additional diphthongs less commonly used, such as [eu] in Euroopa (Europe), [øɑ] in söandama (to dare), and [æu] in näuguma (to mew).

Finnish

[edit]

All Finnish diphthongs are falling. Notably, Finnish has true opening diphthongs (e.g. /uo/), which are not very common crosslinguistically compared to centering diphthongs (e.g. /uə/ in English). Vowel combinations across syllables may in practice be pronounced as diphthongs, when an intervening consonant has elided, as in näön [næøn] instead of [næ.øn] for the genitive of näkö ('sight').

closing
  • [ɑi̯] as in laiva (ship)
  • [ei̯] as in keinu (swing)
  • [oi̯] as in poika (boy)
  • [æi̯] as in äiti (mother)
  • [øi̯] as in öisin (at nights)
  • [ɑu̯] as in lauha (mild)
  • [eu̯] as in leuto (mild)
  • [ou̯] as in koulu (school)
  • [ey̯] as in leyhyä (to waft)
  • [æy̯] as in täysi (full)
  • [øy̯] as in löytää (to find)
close
  • [ui̯] as in uida (to swim)
  • [yi̯] as in lyijy (lead)
  • [iu̯] as in viulu (violin)
  • [iy̯] as in siistiytyä (to smarten up)
opening
  • [ie̯] as in kieli (tongue)
  • [uo̯] as in suo (bog)
  • [yø̯] as in (night)

Northern Sami

[edit]

The diphthong system in Northern Sami varies considerably from one dialect to another. The Western Finnmark dialects distinguish four different qualities of opening diphthongs:

  • /eæ/ as in leat "to be"
  • /ie/ as in giella "language"
  • /oa/ as in boahtit "to come"
  • /uo/ as in vuodjat "to swim"

In terms of quantity, Northern Sami shows a three-way contrast between long, short and finally stressed diphthongs. The last are distinguished from long and short diphthongs by a markedly long and stressed second component. Diphthong quantity is not indicated in spelling.[15]: 148–149  [59]

Semitic languages

[edit]

Maltese

[edit]

Maltese has seven falling diphthongs, though they may be considered VC sequences phonemically.[60]

  • [ɛɪ̯] ej or għi
  • [ɐɪ̯] aj or għi
  • [ɔɪ̯] oj
  • [ɪʊ̯] iw
  • [ɛʊ̯] ew
  • [ɐʊ̯] aw or għu
  • [ɔʊ̯] ow or għu

Sino-Tibetan languages

[edit]

Mandarin Chinese

[edit]

Rising sequences in Mandarin are usually regarded as a combination of a medial semivowel ([j], [w], or [ɥ]) plus a vowel, while falling sequences are regarded as one diphthong.

  • ai: [ai̯], as in ài ('love')
  • ao: [au̯], as in dào ('way')
  • ei: [ei̯], as in lèi ('tired')
  • ou: [ou̯], as in dòu ('bean')

Cantonese

[edit]

Cantonese has eleven falling diphthongs.

  • aai: [aːi̯], as in gaai1 ('street')
  • aau: [aːu̯], as in baau3 ('explode')
  • ai: [ɐi̯], as in gai1 ('chicken')
  • au: [ɐu̯], as in ngau1 ('hook')
  • ei: [ei̯], as in gei1 ('machine')
  • eu: [ɛːu̯], as in deu6 ('throw')
  • eui: [ɵy̯], as in zeoi3 ('drunk')
  • iu: [iːu̯], as in giu3 ('call')
  • oi: [ɔːy̯], as in oi3 ('love')
  • ou: [ou̯], as in gou1 ('high')
  • ui: [uːy̯], as in pui4 ('accompany')

Tai–Kadai languages

[edit]

Thai

[edit]

In addition to vowel nuclei following or preceding /j/ and /w/, Thai has three diphthongs:[61]

  • [ia̯] เ–ีย ia
  • [ɯa̯] เ–ือ uea
  • [ua̯] –ัว ua

Mon-Khmer languages

[edit]

Vietnamese

[edit]

In addition to vowel nuclei following or preceding /j/ and /w/, Vietnamese has three diphthongs:

  • [iə̯] ia ~
  • [ɨə̯] ưa ~ ươ
  • [uə̯] ua ~

Khmer

[edit]

Khmer language has rich vocalics with an extra distinction of long and short register to the vowels and diphthongs.

  • [iə̯]
  • [ei̯]
  • [ɐe̯]
  • [ɨə̯]
  • [əɨ̯]
  • [ɐə̯]
  • [ao̯]
  • [uə̯]
  • [ou̯]
  • [ɔə̯]
  • [eə̯̆]
  • [uə̯̆]
  • [oə̯̆]

Bantu languages

[edit]

Zulu

[edit]

Zulu has only monophthongs. Y and w are semi-vowels:

  • [ja] as in [ŋijaɓɛːɠa] ngiyabeka (I am placing it)
  • [wa] as in [ŋiːwa] ngiwa (I fall/I am falling)

Austronesian languages

[edit]

Indonesian

[edit]

Indonesian has four diphthongs which may be located at the beginning, middle or end of a word.[62] They are:

  • /ai̯/: balairung ('hall'), kedai ('shop'), pandai ('clever')
  • /au̯/: autodidak ('autodidact'), Taufik (Indonesian given name), kerbau ('buffalo'), limau ('lemon')
  • /oi̯/ (or /ʊi̯/ in Indonesian): boikot ('boycott'), amboi (an expression when amazed)
  • /ei̯/: eigendom ('property'), survei ('survey')

See also

[edit]

References

[edit]

Bibliography

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
A diphthong is a vowel sound in which the tongue (or other articulators) moves from one position to another during its production within a single syllable, resulting in a gliding effect between two distinct vowel qualities, often described as an onglide followed by an offglide. This contrasts with monophthongs, which maintain a relatively steady vowel quality without significant movement. In phonetic terms, diphthongs are characterized by dynamic formant transitions, particularly in the second formant (F2), which reflect the spectral changes as the sound glides. Diphthongs occur in many languages and are integral to their phonological systems, though their inventory and realization vary widely; for instance, English has approximately eight to nine diphthongs depending on the dialect, classified primarily as closing diphthongs (gliding toward a higher vowel, such as /eɪ/ in "day" or /aʊ/ in "now") and centering diphthongs (gliding toward a central vowel like schwa, such as /ɪə/ in "beer" or /eə/ in "bear"). In American English, common examples include /aɪ/ (as in "bite"), /eɪ/ ("bait"), /ɔɪ/ ("boys"), /aʊ/ ("pout"), and /oʊ/ ("boat"), with acoustic studies showing that the onset position is particularly salient for perception and classification. These sounds are often spelled with digraphs or contextual vowel letters, such as "ai" in "fine" or "ou" in "loud," though orthographic representation does not always align perfectly with phonetic reality. The study of diphthongs is crucial in phonetics and phonology, as their formant trajectories provide insights into speech production, perception, and language acquisition; for example, developmental research indicates that children initially produce diphthongs with longer durations in the initial segment, gradually refining the glide. Unlike semivowels (which may resemble diphthongs but function consonantly), true diphthongs are vowel nuclei that contribute to syllable structure without interrupting it.

Definition and Fundamentals

Definition

A diphthong is a complex speech sound or phoneme that begins with one vowel quality and glides continuously into another within the same syllable, typically perceived by speakers as a single unitary vowel despite the internal transition. This gliding movement involves a shift in the position of the tongue or other articulators, producing a dynamic vowel quality that changes over the course of its articulation without interruption. The term originates from the Ancient Greek diphthongos, meaning "two sounds," reflecting its composition of two distinct yet connected vowel elements. Key criteria for identifying a diphthong include its continuous articulation, where the transition between the starting and ending vowel qualities occurs smoothly without a break, and its occupation of a single nucleus position within the syllable structure. This distinguishes diphthongs from sequences of separate vowels across syllable boundaries, such as in hiatus, where two independent vowel nuclei are involved. In phonetic transcription, diphthongs are often represented by two consecutive vowel symbols in the International Phonetic Alphabet, such as [aɪ] or [aʊ], indicating the initial and target positions. In contrast to monophthongs, which are simple vowels with a steady, unchanging quality produced by holding the articulators in a relatively fixed position (e.g., or ), diphthongs are complex vowels characterized by this inherent glide, resulting in a perceptually richer sound. For instance, while a monophthong like maintains uniform tongue height and backness throughout, a diphthong such as [eɪ] starts at a mid-front position and moves toward a higher, more fronted quality. This distinction highlights diphthongs as foundational elements in vowel systems across languages, contributing to phonetic diversity within syllables.

Phonetic Properties

Diphthongs involve a smooth articulatory transition between two vowel positions within the same syllable, characterized by gradual shifts in tongue height and advancement, lip rounding or spreading, and jaw position. For example, during the production of the English diphthong /aɪ/, the tongue begins in a low central position and moves upward and forward to a high front target, while the jaw lowers initially and then rises slightly, with lips remaining largely unrounded throughout. In contrast, diphthongs like /aʊ/ feature an initial open jaw and unrounded lips that progress to a more closed position with increased lip rounding as the tongue retracts toward a back vowel target. These coordinated gestures ensure a continuous, non-abrupt glide, distinguishing diphthongs from vowel sequences separated by a consonantal interruption. Acoustically, diphthongs are marked by time-varying formant frequencies, particularly in the first (F1) and second (F2) formants, which trace nonlinear trajectories reflecting the articulatory glide. F1 typically rises or falls in correspondence with changes in tongue height and jaw opening, while F2 shifts with tongue advancement or retraction, often exhibiting a curved path from the onset vowel's steady-state frequencies to those of the offset. For instance, in /aɪ/, F1 decreases from around 700-800 Hz to 300-400 Hz, and F2 rises from 1200-1400 Hz to 2000-2200 Hz over the segment's duration, creating a perceptual sense of movement. These gliding patterns are more pronounced in clear speech, where formant excursions expand without substantially altering the transition rate. Perceptually, listeners integrate the dynamic spectral changes of diphthongs into a single coherent vowel percept, treating the entire glide as one phonological unit despite its internal variation. This unity arises from auditory processing that emphasizes the overall trajectory rather than discrete components, allowing native speakers to distinguish diphthongs from adjacent vowels in hiatus. Studies show that English speakers consistently perceive diphthongs like /ɔɪ/ as monolithic segments, even when acoustic analysis reveals distinct onglide and offglide phases. Key measurement techniques for diphthong properties include spectrographic analysis, which visualizes formant paths as curved bands on time-frequency plots, and kinematic tracking using tools like electromagnetic articulography to correlate tongue and jaw movements with acoustic output. Software such as PRAAT enables precise extraction of formant values at millisecond intervals, facilitating quantification of trajectory extent and velocity. These methods confirm strong correlations between articulatory displacements and formant shifts, such as negative F1-tongue height relations in rising diphthongs.

Classification

Directional Types

Diphthongs are categorized into directional types primarily based on the trajectory of sonority or prominence across their two vocalic elements, where sonority refers to the relative openness and acoustic intensity of vowels, with more open vowels exhibiting higher sonority than closer ones or glides. Falling diphthongs commence with a vowel of higher sonority, transitioning to a semivowel or closer vowel of lower sonority, thereby decreasing overall sonority; this structure positions the initial element as the syllabic nucleus. For instance, the diphthong /aɪ/ in the English word "eye" begins with the low central vowel /a/ of high sonority and glides toward the near-high front /ɪ/, which serves as a less sonorous off-glide. In contrast, rising diphthongs initiate with a semivowel or closer vowel of lower sonority, culminating in a more open vowel of higher sonority, thus increasing sonority and assigning the nuclear role to the final element. An example appears in /iə/ as realized in certain dialects, starting with a high front glide-like /i/ and proceeding to the mid central /ə/ for greater sonority prominence. The classification criteria emphasize relative pitch height—perceived as fundamental frequency—or vowel prominence within the glide sequence, where the element with greater duration, intensity, or openness dominates as the head, dictating the directional label. Phonetic diagrams typically depict these trajectories on a vowel chart trapezoid, with falling diphthongs shown as curved arrows originating from lower (more open) positions and curving upward toward higher (more closed) frontal or back targets, illustrating the sonority descent; rising diphthongs, meanwhile, feature arrows from higher initial positions gliding downward to more central or open endpoints, highlighting the sonority ascent. This sonority-based directional framework complements but differs from classifications by height and quality, which prioritize vowel openness degrees rather than prominence shifts.

Height and Quality Types

Diphthongs are classified by the nature of height changes and quality variations in their glide, reflecting shifts in tongue position and vowel openness. Closing diphthongs feature a glide toward a higher, more closed vowel, typically ending in a near-high position like /ɪ/ or /ʊ/. In English, examples include /eɪ/ as in "bay," /aɪ/ as in "buy," and /ɔɪ/ as in "boy," which glide toward /ɪ/, and /aʊ/ as in "cow" and /əʊ/ as in "go," which target /ʊ/. These movements increase sonority closure within the syllable. Opening diphthongs, by contrast, end in a lower, more open vowel, involving a descent in tongue height. Such diphthongs are less prevalent in English but appear in languages like Spanish, as in /ja/ pronounced in "hacia" (toward), where the glide moves from high front /j/ (semivowel) to open central /a/. This type often aligns with rising patterns in syllable-initial positions across various phonetic systems. Centering diphthongs involve a glide toward a mid-central schwa-like vowel /ə/, often resulting in an opening quality due to the central target's relative openness. Common in English Received Pronunciation, they include /ɪə/ as in "beer," /eə/ as in "bear," and /ʊə/ as in "poor," where the tongue shifts from peripheral to central position. These are distinct from other types by their convergence to a neutral central vowel. Quality variations further differentiate diphthongs as narrow or wide, based on the magnitude of the articulatory glide. Narrow diphthongs exhibit minimal tongue displacement, such as /eɪ/ in "gate" or /oʊ/ in "go," spanning a smaller portion of the vowel space. Wide diphthongs, conversely, involve substantial movement, like /aɪ/ in "ride" or /aʊ/ in "loud," traversing from open to near-close positions for a more pronounced quality shift. These distinctions highlight the spectrum of perceptual and articulatory complexity in diphthong realization.

Durational Aspects

Diphthongs can be classified as long or short based on their overall duration, which influences syllable weight in many languages. In English, a syllable containing a diphthong is considered heavy or bimoraic, thereby attracting primary stress more readily than light syllables with short monophthongs. This weight sensitivity ensures that words like "house" (with the diphthong /aʊ/) receive stress on the diphthong-bearing syllable, contributing to the language's rhythmic structure. The temporal structure of diphthongs often exhibits uneven timing between their elements, with the initial vowel typically occupying a greater portion of the total duration, particularly in falling diphthongs. Acoustic studies of American English diphthongs reveal that the first demisyllable (initial element) has a normalized duration that varies by diphthong type, generally longer than the glide, which facilitates perceptual identification of the sound's trajectory. In falling types like [aɪ], this asymmetry results in a bipartite pattern where the onset vowel dominates temporally before transitioning to the offglide. Cross-linguistically, diphthong duration may serve as a phonemic contrast or merely an allophonic variation depending on the language. In North Saami, certain diphthongs, such as high ones like [ie] and [uo], exhibit ternary length distinctions (short, long, overlong), where duration differences are phonemically relevant and affect moraic weight, as seen in minimal triplets distinguishing meanings via diphthong length. Conversely, in languages like English, variations in diphthong duration are largely allophonic, influenced by phonetic context such as stress or following consonants, without altering phonemic identity. In languages with phonemic length contrasts, diphthong duration plays a key role in prosody and rhythm by aligning with higher-level boundaries. For instance, in Thai, long diphthongs undergo prosodic lengthening—often doubling in duration at phrase edges—which reinforces tonal and rhythmic patterns, contributing to the language's mora-timed qualities. This lengthening enhances syllable prominence and overall speech timing, distinguishing prosodic units in connected discourse.

Phonological Aspects

Role in Syllable Structure

In phonological theory, diphthongs function as the nucleus of a syllable, occupying the peak position where sonority is highest, and are treated as a single unit rather than two separate vowels. This unitary status allows diphthongs to serve as the core of the syllable without necessitating a division into multiple vocalic elements, as seen in the English word "time" pronounced as [taɪm], where [aɪ] forms the entire nucleus. Diphthongs interact with surrounding consonants within canonical CV (consonant-vowel) syllable structures, often imposing restrictions on onset and coda placement to maintain sonority sequencing. For instance, in many languages, a diphthong in the nucleus can be followed by a coda consonant only if it does not violate rising sonority principles, as in English [haʊs] "house," where the coda follows the falling diphthong [aʊ]. These interactions help define permissible syllable margins, preventing certain consonant clusters that would otherwise arise if the diphthong were analyzed as a vowel sequence. Syllabification rules frequently incorporate diphthongs into complex nuclei, influencing processes like ambisyllabicity, where a consonant may affiliate with both preceding and following syllables. In words like English "fire" [faɪər], the diphthong [aɪə] forms a complex nucleus that can lead to resyllabification ambiguities, such as whether belongs to the coda or the next onset, depending on morphological boundaries. This complexity arises because diphthongs expand the nucleus without adding syllables, adhering to universal tendencies for sonority peaks within a single timing slot. The presence of diphthongs in the nucleus has significant implications for stress and intonation patterns, as they often attract primary stress due to their inherent prominence and duration. In languages like Dutch, diphthongs act as strong stress attractors, forming superheavy syllables that prioritize them for accentuation over monophthongs, as in stressed [aɪ] versus unstressed variants. For intonation, the gliding quality of diphthongs allows pitch contours to unfold dynamically across the nucleus, contributing to prosodic phrasing and emphasis in connected speech.

Phonemic Contrasts

In phonology, diphthongs achieve phonemic status when they form part of a language's contrastive inventory, distinguishing them from monophthongs through meaningful oppositions that alter word identity. For example, in English, /eɪ/ contrasts with /ɛ/, as in "bait" versus "bet," where the off-glide in the diphthong creates a distinct phoneme capable of differentiating lexical items. This contrastive function is supported by evidence from speech production and processing experiments, where speakers consistently treat major diphthongs as unitary segments rather than sequences of two vowels, reinforcing their role as single phonemes. Allophonic variations, by contrast, involve diphthong-like glides that emerge predictably in phonetic contexts without serving a contrastive purpose, such as off-glides following tense vowels or in hiatus resolution. These non-contrastive glides do not belong to the phonemic inventory and are instead realizations of underlying monophthongs, varying systematically based on surrounding segments or prosodic position. The cited experiments illustrate this for diphthongs, with some treated as single units influenced by context. The phonemic contrasts involving diphthongs are vividly illustrated in minimal pairs, where substitution of a diphthong for a monophthong—or vice versa—results in words with entirely different meanings, underscoring the glide's role in semantic differentiation. Such pairs highlight how the dynamic quality of the diphthong provides a perceptual cue for phonemic opposition, distinct from static monophthongal vowels. In theoretical terms, frameworks like feature geometry model diphthongs as complex segments associated with a single timing slot, which captures both their internal articulatory trajectory and unitary phonological behavior. This representation aligns with their frequent function as syllable nuclei, enabling cohesive prosodic integration.

Historical Development

Sound Changes Forming Diphthongs

Diphthongization refers to the historical sound change whereby a monophthong, or single vowel sound, develops into a diphthong through gliding or breaking, often triggered by assimilation to adjacent consonants or prosodic factors. This process is commonly observed in Germanic languages, where short front vowels diphthongize before certain back or liquid consonants; for instance, in Old English, the vowels /i/, /e/, and /æ/ broke into /iu/, /eo/, and /æɑ/ respectively before /h/, /r/, or /l/ followed by another consonant, as in eald 'old' from earlier ald. Similarly, in Middle Korean, the monophthong /a/ in taj 'great' shifted to the diphthong /əj/ in the modern language, illustrating a gradual off-gliding effect. Iotation, the palatalization induced by a following /j/ or high front vowel, frequently contributes to diphthong formation by inserting or fusing a palatal glide into a preceding vowel. In Proto-Germanic, sequences like *eu before *i or j resulted in *iu, which further evolved into diphthongs in daughter languages; for example, PIE *leuk- 'light' became Proto-Germanic *leuχ, then Old English *lēōht with a diphthongal reflex. Labialization, the analogous rounding influence from /w/ or back glides, similarly creates labial diphthongs, as seen in the development of rounded diphthongs from vowel + /w/ sequences in various Indo-European branches. These changes often arise from the assimilation of glides to vowels in hiatus, transforming vowel sequences into complex nuclei. Reconstructions of proto-languages reveal diphthongs as primary phonemes in Proto-Indo-European (PIE), such as *ei, *oi, *ai, *eu, *ou, *au, which often involved resonant elements and evolved variably in descendant languages through syllable restructuring, as evidenced in ablaut patterns where /ghew-/ 'pour' yields Sanskrit juhóti and Greek χέω with resonant integration. PIE short diphthongs like *ai and *au merged and monophthongized in some branches but persisted or reformed in others, such as Proto-Germanic *aiz remaining as a diphthong before lengthening to ā in Old English ār 'brass, bronze', cognate to Latin aes. Additionally, laryngeal consonants in PIE could color adjacent vowels, contributing to diphthong-like formations in daughter languages. Factors such as stress placement and dialectal variation accelerate diphthong formation by promoting breaking in stressed syllables or unevenly across regions. Stressed vowels are more prone to diphthongization due to heightened articulatory tension, as in the Old English breaking process limited to stressed positions, while dialectal differences can preserve or innovate diphthongs variably, contributing to diversity in outcomes like the partial monophthongization of *ai in some Germanic dialects versus retention in others.

Diphthong Shifts and Monophthongization

Monophthongization refers to the historical sound change in which a diphthong reduces to a single vowel by loss or weakening of the offglide, often resulting in simplification of the vowel system. This process contrasts with diphthong formation and frequently occurs as part of broader vowel shifts, leading to mergers that diminish phonemic distinctions. In many languages, it contributes to reducing the complexity of syllable nuclei over time. A prominent example appears in the transition from Old English to Middle English, where all diphthongs smoothed into monophthongs by the early Middle English period. For instance, the Old English long diphthongs /ēa/ (as in <dēop>, "deep") and /ēo/ (as in <dēor>, "animal") monophthongized to /ēː/ and /ōː/ respectively in Late West Saxon by the 11th century, with /ēa/ sometimes yielding /ǣː/ in certain dialects. This leveling eliminated the original four short and four long diphthongs (/æɑ, eɑ, eɔ, iɑ/ and their long counterparts), merging them into a simpler set of monophthongs and paving the way for new diphthongs to emerge later in Middle English. Diphthongs also participate in chain shifts, where sequential adjustments in vowel quality maintain contrasts across the system. In the Great Vowel Shift (c. 1400–1700), the diphthongization of Middle English high monophthongs /iː/ and /uː/ to /aɪ/ and /aʊ/ created new diphthongs that interacted with raising of lower vowels, forming a push-chain that preserved phonemic inventory while altering realizations (e.g., /aɪ/ from /iː/ in "bite," /aʊ/ from /uː/ in "house"). Later, some of these diphthongs underwent partial monophthongization in Early Modern English; for example, Middle English /aʊ/ (as in "lawe," "law") simplified to /ɔ/, /ɔʊ/ (as in "snow") to /o/, and /ɛɪ/ (as in "hail") to /e/. Such changes often occur under specific conditions, including unstressed positions where the glide weakens due to reduced articulatory effort, or through assimilation where the offglide blends with adjacent consonants, promoting glide deletion. For instance, in chain shifts, falling nuclei of upgliding diphthongs (e.g., /aɪ/ lowering its onset) can trigger compensatory adjustments in related vowels to avoid mergers. Outcomes typically include phonemic mergers, as seen in English where multiple diphthongs converged on shared monophthongs, thereby contracting the vowel inventory and eliminating contrasts (e.g., former OE /eɑ/ and /æɑ/ both yielding /æː/ in some Middle English varieties).

Versus Semivowels

Semivowels, also known as glides, are consonant-like sounds such as /j/ (as in English "yes") and /w/ (as in English "wet"), which function as consonantal glides in syllable margins, whereas diphthongs are gliding vowel sounds that occur within the syllable nucleus, combining a primary vowel with a secondary vocalic glide to form a single complex vowel unit. Distributional evidence distinguishes semivowels from diphthongs, as semivowels typically appear in onset or coda positions alongside consonants—for instance, /j/ in syllable-initial clusters like /pj/ in "pure" or /w/ in coda like /kw/ in "quick"—while diphthongs are restricted to the nuclear position and cannot occupy marginal roles independently. This positional restriction is evident in phonological rules, such as vowel epenthesis or glide insertion, which treat semivowels as consonants eligible for onset or coda slots but bar diphthongs from such environments. Phonetically, semivowels and the gliding components of diphthongs share similar articulations, with both involving rapid transitions from or to a high vowel position (e.g., -like for /j/ or -like for /w/), but they differ in sonority profiles: diphthongs exhibit a primary sonority peak at the nuclear vowel followed by a secondary peak or fall associated with the glide, maintaining overall vocalic prominence, whereas semivowels display lower sonority relative to adjacent vowels, behaving more like consonants in the syllable contour. This sonority distinction underscores the vocalic nature of diphthongs, where the glide integrates into the nucleus without forming a separate sonority minimum typical of consonantal glides. In generative phonology, theoretical debates center on the underlying relation between glides and vowels, with classic accounts like The Sound Pattern of English treating semivowels as non-syllabic vowels specified as [-cons, -syllabic], allowing rules to derive their consonantal behavior from positional context, while alternative views posit them as underlying consonants to account for their marginal distributions and resistance to vocalization in certain rules. These perspectives highlight ongoing discussions about whether glide-vowel alternations (e.g., /i/ surfacing as in onsets) reflect abstract feature sharing or language-specific derivations.

Versus Vowel Sequences

A diphthong consists of a glide-vowel or vowel-glide sequence within a single syllable, resulting in a smooth articulatory and acoustic transition between the two vocalic elements. In contrast, a hiatus involves two distinct vowels belonging to separate syllables, often separated by a glottal stop or other consonantal articulation to maintain syllabic boundaries. This distinction is crucial in languages like Spanish, where sequences such as [a.i] in caer ('to fall') are realized as hiatus ([kaˈer]), while [e.j] in reina ('queen') forms a diphthong ([ˈrej.na]). Phonological tests for distinguishing these structures include resyllabification across word boundaries. In hiatus, the second vowel can incorporate a preceding consonant as its onset in connected speech, as seen in Spanish phrases like lo importante ([lo.im.poɾˈtan.te]), where the /m/ resyllabifies to the following syllable. Diphthongs, being tautosyllabic, do not permit such resyllabification, preserving the glide within the original syllable nucleus. Another test involves the insertion of epenthetic consonants, such as glottal stops, which typically occur between vowels in hiatus to enforce syllabic separation—for instance, in American English word-boundary sequences like "see it" ([siʔɪt]), but not within true diphthongs like [aɪ] in eye. Acoustically, hiatus exhibits abrupt formant transitions, longer overall duration, and greater spectral curvature in the second formant (F2) trajectory, reflecting two independent vowel targets. Diphthongs, by comparison, show smoother, more continuous formant movements with reduced static phases and steeper spectral changes, indicative of a unified syllabic gesture. These cues are modulated by speech rate: faster speech tends to blur the boundary, potentially reducing hiatus to diphthong-like realizations. Cross-linguistically, there is a strong tendency for hiatus involving high vowels (e.g., [iV] or [uV]) to resolve via glide insertion, transforming the sequence into a diphthong for articulatory efficiency, as observed in the evolution of Romance languages from Latin. In Romanian and Spanish, this process is partial, maintaining some contrast, while in French it is complete, with all such sequences diphthongized; in Portuguese, however, iV sequences are realized as hiatus. Semivowels play a key role in this resolution, often emerging from the high vowel to bridge the hiatus.

Transcription and Notation

IPA Representations

In the International Phonetic Alphabet (IPA), diphthongs are transcribed as sequences of two vowel symbols, representing the starting and ending vowel qualities within a single syllable nucleus, such as [aɪ] for a low central to high front glide or [aʊ] for a low central to high back glide. This notation captures the gliding transition without implying equal duration for each element, treating the diphthong as a unitary sound rather than a strict concatenation of monophthongs. To indicate that the two vowel symbols form an indivisible unit, a tie bar (͡ or ‿) may be placed above or below them, as in [a͡ɪ] or [e͡ɪ], particularly when distinguishing the diphthong from a potential vowel hiatus or sequence. This convention is more commonly applied to affricates and co-articulated consonants, and while optional, tie bars are rarely used for diphthongs in standard IPA practice. Offglides, the secondary target of the glide, can be denoted using diacritics on the primary vowel for brevity, such as [aʲ] for a palatal offglide (equivalent to [ai̯]) or [aʷ] for a labial-velar offglide (equivalent to [au̯]); the non-syllabic marker ̯ is alternatively used after the second symbol to specify a glide, as in [ai̯]. Pre-IPA systems, such as that developed by British phonetician Daniel Jones, employed simplified notations without specialized symbols, representing common diphthongs as digraphs like ai (for [aɪ]), au (for [aʊ]), or ou (for [oʊ]) in broad English transcriptions. These alternatives prioritized accessibility for language teaching over the full articulatory detail of IPA. Regional preferences among phoneticians influence diphthong transcription: British practitioners often favor centralized or lowered diacritics for offglides in Received Pronunciation (e.g., [aɪ̇] with centralization), while American phoneticians may prefer broader notations without ties, emphasizing rhotic influences in General American (e.g., [aɚ] for r-colored variants). Length in diphthongs is transcribed by applying the length marker [ː] to the dominant initial element, as in [aːɪ] for a prolonged starting point, or using half-length [ˑ] for intermediate durations like [aˑɪ]; extra-short glides may employ the breve [̆] on the offglide, such as [aĭ]. Stress is indicated suprasegmentally before the syllable containing the diphthong, with primary stress as [ˈaɪ] and secondary as [ˌaɪ], ensuring the mark applies to the entire gliding unit rather than individual components. These rules maintain consistency across languages while accommodating phonetic nuances.

Orthographic Variations

In alphabetic writing systems, diphthongs are frequently represented through digraphs—pairs of letters denoting a single gliding sound—or ligatures, which fuse characters for efficiency or historical reasons. For instance, in English, the diphthong /aʊ/ is orthographically rendered as the digraph "au" in words like "out" or "ow" in "cow," reflecting Middle English developments where such combinations emerged to distinguish vowel glides from monophthongs. Similarly, /eɪ/ appears as "ei" in "vein" or "ey" in "they," adaptations that arose during the Great Vowel Shift when earlier /aɪ/ evolved, leading to inconsistent but standardized spellings by the early modern period. Ligatures like æ (ash), originating in Old English manuscripts, historically denoted diphthongs such as [æə] or mergers from Proto-Germanic [ai], as seen in Wessex dialect texts where it contrasted with simple or . The œ ligature, a fusion of and , similarly captured [oe̯] diphthongs in Latin-influenced scripts, persisting in early English borrowings until the 19th century when typewriting favored separated "oe." Language-specific orthographies often exhibit inconsistencies in diphthong spelling due to historical sound changes and regional variations. In French, the digraph "oi" consistently represents the diphthong /wa/, as in "voix" (voice), a remnant of Old French [oi] that simplified over time but retained its orthographic form. The combination "eu" denotes monophthongs like /ø/ in "peu" (little) or /œ/ in words like "peur" (fear), where it evolved from Latin /eu/ via Old French mergers, creating spelling irregularities compared to monophthongs like /ɛ/. These variations stem from etymological influences, such as Latin and Germanic loans, leading to non-phonetic mappings where context determines pronunciation. Orthographic reforms and standardizations have significantly shaped diphthong notation in modern languages by prioritizing consistency over phonetic accuracy. In English, the late 15th-century introduction of printing presses, centered in London, promoted Chancery English norms that fixed digraphs like "ou" for /aʊ/ in "house," despite the Great Vowel Shift altering sounds and creating mismatches with earlier spellings. This standardization reduced dialectal variants but preserved historical forms, as seen in the replacement of runic ligatures like æ with digraphs in post-Norman texts. In non-alphabetic traditions, such as Chinese, the Pinyin romanization system—standardized in 1958—employs digraphs to approximate diphthongs for logographic characters, with "ai" for /ai/ (e.g., "ài" love), "ei" for /ei/ (e.g., "bèi" back), "ao" for /au/ (e.g., "dào" way), and "ou" for /ou/ (e.g., "dòu" bean), bridging syllabic tones and vowel glides without altering the underlying hanzi script.

Examples Across Languages

Indo-European Languages

In Proto-Indo-European, diphthongs such as *ei were reconstructed as vowel-resonant sequences that evolved variably across descendant languages, often monophthongizing or shifting in quality depending on the branch; for instance, *ei developed into long ē in Latin but remained a diphthong longer in early Greek forms before further changes. In English Received Pronunciation, the diphthong inventory includes closing diphthongs like /eɪ/ (as in "bate" [beɪt]), /aʊ/ (as in "how" [haʊ]), and /ɔɪ/ (as in "boy" [bɔɪ]), where the vowel glides from a more open to a closer position within a single syllable. Dialectal variations, such as in some non-RP accents, feature centering diphthongs like /aɪə/ (as in "fire" [faɪə]), where the glide moves toward a schwa-like central vowel. Standard German maintains three primary closing diphthongs as phonemic units: /aɪ/ (as in "Stein" [ʃtaɪn] 'stone'), /aʊ/ (as in "Haus" [haʊs] 'house'), and /ɔʏ/ (as in "Feuer" [fɔʏɐ] 'fire'), which contrast meaningfully in minimal pairs and distinguish lexical items from monophthongal vowels. Historical forms of French featured nasalized diphthongs, such as [ẽɪ] arising from Old French sequences like [ai] or [ei] before nasals (e.g., in 16th-century "haine"), reflecting a stage of regressive nasalization and vowel coalescence before monophthongization in modern varieties. Spanish treats rising diphthongs like /je/ (as in "piel" [pjel] 'skin') and /we/ (as in "muela" [mweɫa] 'molar') as single syllabic units, where a high vowel glide ( or ) precedes a stronger mid or low vowel, forming a complex nucleus without hiatus. In Ancient Greek, diphthongs such as αι (/ai/, pronounced like "aisle") and ει (/ei/, like "eight") were common proper diphthongs combining two vowels in one syllable, often marked by aspiration or length in classical texts. These evolved in Modern Greek through monophthongization, with αι shifting to /e/ (as in "και" [ce] 'and') by the Hellenistic period and ει to /i/ (as in "είναι" [ˈini] 'to be') as early as the 1st century BCE, evidenced in papyri and inscriptions showing the loss of the secondary vowel element.

Non-Indo-European Languages

In non-Indo-European languages, diphthongs exhibit significant typological variation, often interacting with features like tone, nasalization, and syllable structure unique to language families such as Sino-Tibetan, Austroasiatic, Niger-Congo, Uralic, Kra-Dai, and Austronesian. These diphthongs frequently appear as rising or centering forms in tonal and isolating languages, where pitch contours can influence their realization and perceptual boundaries. For instance, rising diphthongs are prevalent in tone languages, as their gliding trajectory aligns with rising tones, enhancing contrastive distinctions without requiring additional segmental complexity. Mandarin Chinese, a Sino-Tibetan language, features rare true diphthongs in syllable finals, such as /aɪ/ (as in ài 'love'), which are phonologically often analyzed as sequences of a vowel and a glide rather than unitary phonemes. This distinction arises because finals like ai and ei behave as V-G (vowel-glide) combinations in phonological rules, though they are articulated with smooth transitions resembling diphthongs in connected speech. Triphthongs like /aɪ̯ɔ/ in yǎo further illustrate this pattern, but true diphthongal status is debated due to the language's strict syllable constraints. Vietnamese, an Austroasiatic language, contrasts phonemic centering diphthongs such as /iə/ (as in kia 'when'), /ɨə/ (as in nghỉa 'meaning'), and /uə/ (as in mưa 'rain'), which function as distinct phonemes within its eleven-vowel system. These falling diphthongs are integral to the language's rhyme structure, appearing in open syllables and interacting with its six tones to create minimal pairs, such as bịa [ɓiə˧˦] 'nonsense' versus bia [ɓiə˦˥] 'beer'. Phonetic studies confirm their status through formant transitions, distinguishing them from monophthongal vowels. In Zulu, a Niger-Congo Bantu language, diphthongs are generally absent from the native phonemic inventory, with potential vowel sequences treated as hiatus; nasalization spreads from prenasalized consonants but does not create diphthong-like transitions in vowels, aligning with broader Bantu tendencies where vowel sequences avoid true diphthongs but incorporate nasal features for morphological marking. Finnish, a Uralic language, employs closing diphthongs like /ai/ (as in maito 'milk'), /oi/ (as in koira 'dog'), and /ui/ (as in luin 'wave'), which are phonemic and occur primarily in non-initial syllables to satisfy vowel harmony and quantity requirements. These diphthongs, numbering eighteen in total, contrast with long monophthongs and undergo no allophonic variation, maintaining distinct formant trajectories in heavy syllables. Their distribution highlights Finnish's agglutinative structure, where diphthongs form in suffixation without altering stress patterns. Thai, a Kra-Dai language, includes both falling and rising diphthongs such as /ai/ (as in mai 'new' or 'not'), /au/ (as in máu 'silk'), and /ia/ (as in sǐa 'four'), whose perception is modulated by the language's five tones. Tones alter diphthong duration and gliding speed—rising tones like /˦˥/ elongate the offset in /ai/, creating perceptual shifts that distinguish homographs, as in mái [máj˦˥] 'silk' versus màj [máj˩˩] 'burn'. Acoustic analyses reveal that short diphthongs like /ai/ versus long /aːj/ further interact with tone contours for lexical contrast. Among Austronesian languages, Hawaiian exemplifies simple glides forming diphthongs like /ai/ (as in kaikua 'child') and /au/ (as in mauna 'mountain'), which are phonemically heavy syllables bearing primary stress on the initial vowel. These nine short diphthongs contrast with long monophthongs in open syllables, with no glottal stops interrupting the glide, as confirmed by formant coarticulation patterns. Japanese, a Japonic isolate, features near-diphthongs like /ai/ (as in ai 'love'), phonologically treated as vowel sequences /a.i/ but realized with diphthongal offgliding [aɪ̯] in casual speech due to moraic timing. This approximation avoids true phonemic diphthongs, aligning with the language's CV syllable structure.

References

Add your contribution
Related Hubs
User Avatar
No comments yet.