Hubbry Logo
search
logo

Regular and irregular verbs

logo
Community Hub0 Subscribers
Read side by side
from Wikipedia

A regular verb is any verb whose conjugation follows the typical pattern, or one of the typical patterns, of the language to which it belongs. A verb whose conjugation follows a different pattern is called an irregular verb. This is one instance of the distinction between regular and irregular inflection, which can also apply to other word classes, such as nouns and adjectives.

In English, for example, verbs such as play, enter, and like are regular since they form their inflected parts by adding the typical endings -s, -ing and -ed to give forms such as plays, entering, and liked. On the other hand, verbs such as drink, hit and have are irregular since some of their parts are not made according to the typical pattern: drank and drunk (not "drinked"); hit (as past tense and past participle, not "hitted") and has and had (not "haves" and "haved").

The classification of verbs as regular or irregular is to some extent a subjective matter. If some conjugational paradigm in a language is followed by a limited number of verbs, or if it requires the specification of more than one principal part (as with the German strong verbs), views may differ as to whether the verbs in question should be considered irregular. Most inflectional irregularities arise as a result of series of fairly uniform historical changes so forms that appear to be irregular from a synchronic (contemporary) point of view may be seen as following more regular patterns when the verbs are analyzed from a diachronic (historical linguistic) viewpoint.

Development

[edit]

When a language develops some type of inflection, such as verb conjugation, it normally produces certain typical (regular) patterns by which words in the given class come to make their inflected forms. The language may develop a number of different regular patterns, either as a result of conditional sound changes which cause differentiation within a single pattern, or through patterns with different derivations coming to be used for the same purpose. An example of the latter is provided by the strong and weak verbs of the Germanic languages; the strong verbs inherited their method of making past forms (vowel ablaut) from Proto-Indo-European, while for the weak verbs a different method (addition of dental suffixes) developed.

Irregularities in verb conjugation (and other inflectional irregularities) may arise in various ways. Sometimes the result of multiple conditional and selective historical sound changes is to leave certain words following a practically unpredictable pattern. This has happened with the strong verbs (and some groups of weak verbs) in English; patterns such as sing–sang–sung and stand–stood–stood, although they derive from what were more or less regular patterns in older languages, are now peculiar to a single verb or small group of verbs in each case, and are viewed as irregular.

Irregularities may also arise from suppletion – forms of one verb may be taken over and used as forms of another. This has happened in the case of the English word went, which was originally the past tense of wend, but has come to be used instead as the past tense of go. The verb be also has a number of suppletive forms (be, is, was, etc., with various different origins) – this is common for copular verbs in Indo-European languages.

The regularity and irregularity of verbs is affected by changes taking place by way of analogy – there is often a tendency for verbs to switch to a different, usually more regular, pattern under the influence of other verbs. This is less likely when the existing forms are very familiar through common use – hence among the most common verbs in a language (like be, have, go, etc.) there is often a greater incidence of irregularity. (Analogy can occasionally work the other way, too – some irregular English verb forms such as shown, caught and spat have arisen through the influence of existing strong or irregular verbs.)[citation needed]

Types of pattern

[edit]

The most straightforward type of regular verb conjugation pattern involves a single class of verbs, a single principal part (the root or one particular conjugated form), and a set of exact rules which produce, from that principal part, each of the remaining forms in the verb's paradigm. This is generally considered to be the situation with regular English verbs – from the one principal part, namely the plain form of a regular verb (the bare infinitive, such as play, happen, skim, interchange, etc.), all the other inflected forms (which in English are not numerous; they consist of the third person singular present tense, the past tense and past participle, and the present participle/gerund form) can be derived by way of consistent rules. These rules involve the addition of inflectional endings (-s, -[e]d, -ing), together with certain morphophonological rules about how those endings are pronounced, and certain rules of spelling (such as the doubling of certain consonants). Verbs which in any way deviate from these rules (there are around 200 such verbs in the language) are classed as irregular.

A language may have more than one regular conjugation pattern. French verbs, for example, follow different patterns depending on whether their infinitive ends in -er, -ir or -re (complicated slightly by certain rules of spelling). A verb which does not follow the expected pattern based on the form of its infinitive is considered irregular.

In some languages, however, verbs may be considered regular even if the specification of one of their forms is not sufficient to predict all of the rest; they have more than one principal part. In Latin, for example, verbs are considered to have four principal parts (see Latin conjugation for details). Specification of all of these four forms for a given verb is sufficient to predict all of the other forms of that verb – except in a few cases, when the verb is irregular.

To some extent it may be a matter of convention or subjective preference to state whether a verb is regular or irregular. In English, for example, if a verb is allowed to have three principal parts specified (the bare infinitive, past tense and past participle), then the number of irregular verbs will be drastically reduced (this is not the conventional approach, however). The situation is similar with the strong verbs in German (these may or may not be described as irregular). In French, what are traditionally called the "regular -re verbs" (those that conjugate like vendre) are not in fact particularly numerous, and may alternatively be considered to be just another group of similarly behaving irregular verbs. The most unambiguously irregular verbs are often very commonly used verbs such as the copular verb be in English and its equivalents in other languages, which frequently have a variety of suppletive forms and thus follow an exceptionally unpredictable pattern of conjugation.

Irregularity in spelling only

[edit]

It is possible for a verb to be regular in pronunciation, but irregular in spelling. Examples of this are the English verbs lay and pay. In terms of pronunciation, these make their past forms in the regular way, by adding the /d/ sound. However their spelling deviates from the regular pattern: they are not spelt "layed" and "payed" (although the latter form is used in some e.g. nautical contexts as "the sailor payed out the anchor chain"), but laid and paid. This contrasts with fully regular verbs such as sway and stay, which have the regularly spelt past forms swayed and stayed. The English present participle is never irregular in pronunciation, with the exception that singeing irregularly retains the e to distinguish it from singing.

Linguistic study

[edit]

In linguistic analysis, the concept of regular and irregular verbs (and other types of regular and irregular inflection) commonly arises in psycholinguistics, and in particular in work related to language acquisition. In studies of first language acquisition (where the aim is to establish how the human brain processes its native language), one debate among 20th-century linguists revolved around whether small children learn all verb forms as separate pieces of vocabulary or whether they deduce forms by the application of rules.[1] Since a child can hear a regular verb for the first time and immediately reuse it correctly in a different conjugated form which he or she has never heard, it is clear that the brain does work with rules; but irregular verbs must be processed differently. A common error for small children is to conjugate irregular verbs as though they were regular, which is taken as evidence that we learn and process our native language partly by the application of rules, rather than, as some earlier scholarship had postulated, solely by learning the forms. In fact, children often use the most common irregular verbs correctly in their earliest utterances but then switch to incorrect regular forms for a time when they begin to operate systematically. That allows a fairly precise analysis of the phases of this aspect of first language acquisition.

Regular and irregular verbs are also of significance in second language acquisition, and in particular in language teaching and formal learning, where rules such as verb paradigms are defined, and exceptions (such as irregular verbs) need to be listed and learned explicitly. The importance of irregular verbs is enhanced by the fact that they often include the most commonly used verbs in the language (including verbs such as be and have in English, their equivalents être and avoir in French, sein and haben in German, etc.).

In historical linguistics the concept of irregular verbs is not so commonly referenced. Since most irregularities can be explained by processes of historical language development, these verbs are only irregular when viewed synchronically; they often appear regular when seen in their historical context. In the study of Germanic verbs, for example, historical linguists generally distinguish between strong and weak verbs, rather than irregular and regular (although occasional irregularities still arise even in this approach).

When languages are being compared informally, one of the few quantitative statistics which are sometimes cited is the number of irregular verbs. These counts are not particularly accurate for a wide variety of reasons, and academic linguists are reluctant to cite them. But it does seem that some languages have a greater tolerance for paradigm irregularity than others.

By language

[edit]

English

[edit]

With the exception of the highly irregular verb be, an English verb can have up to five forms: its plain form (or bare infinitive), a third person singular present tense, a past tense (or preterite), a past participle, and the -ing form that serves as both a present participle and gerund.

The rules for the formation of the inflected parts of regular verbs are given in detail in the article on English verbs. In summary they are as follows:

  • The third person singular present tense is formed by adding the ending -s (or -es after certain letters) to the plain form. When the plain form ends with the letter -y following a consonant, this becomes -ies. The ending is pronounced /s/ after a voiceless consonant sound (as in hops, halts, packs, bluffs, laughs), or /z/ after a voiced consonant or vowel sound (as in robs, lends, begs, sings, thaws, flies, sighs), but /ɪz/ after a sibilant (passes, pushes, marches).
  • The past tense and past participle are identical; they are formed with the ending -ed (or -t in some verbs), which as in the previous case has three different pronunciations (/t/, /d/, /ɪd/). Certain spelling rules apply, including the doubling of consonants before the ending in forms like conned and preferred. There is some variation in the application of these spelling rules with some rarer verbs, and particularly with verbs ending -c (panic–panicked, zinc–zinc(k)ed, arc–arc(k)ed, etc.), meaning that these forms are not fully predictable, but such verbs are not normally listed among the irregular ones. (The verbs lay and pay, however, are commonly listed as irregular, despite being regular in pronunciation – their past forms have the anomalous spellings laid and paid.)
  • The present participle/gerund is formed by adding -ing, again with the application of certain spelling rules similar to those that apply with -ed.

The irregular verbs of English are described and listed in the article English irregular verbs (for a more extensive list, see List of English irregular verbs). In the case of these:

  • The third person singular present tense is formed regularly, except in the case of the modal verbs (can, shall, etc.) which do not add -s, the verb be (which has three present indicative forms: am, is and are), and the three verbs have, do and say, which produce the forms has, does (pronounced with a short vowel, /dʌz/), and says (pronounced with a short vowel, /sɛz/).[2]
  • The past tense and past participle forms are the forms most commonly made in irregular fashion. About 200 verbs in normal use have irregularities in one or other (or usually both) of these forms. They may derive from Germanic strong verbs, as with sing–sang–sung or rise–rose–risen, or from weak verbs which have come to deviate from the standard pattern in some way (teach–taught–taught, keep–kept–kept, build–built–built, etc.). (The past participle often ends in "n", " d" or "ed".) The past and past participle forms change in spelling sometimes.
  • The present participle/gerund is formed regularly, in -ing (except for those defective verbs, such as the modals, which lack such a form).

Common irregular verbs

[edit]

Some examples of common irregular verbs in English, other than modals, are:[3]

  • arise
  • be
  • come
  • do
  • eat
  • fall
  • get
  • give
  • go
  • have
  • hear
  • know
  • lend
  • make
  • run
  • say
  • see
  • take
  • think
  • wear
  • drink
  • put
  • cut
  • catch
  • drive

Other languages

[edit]

For regular and irregular verbs in other languages, see the articles on the grammars of those languages. Particular articles include, for example:

Some grammatical information relating to specific verbs in various languages can also be found in Wiktionary.

Constructed languages

[edit]

Most natural languages, to different extents, have a number of irregular verbs. Artificial auxiliary languages usually have a single regular pattern for all verbs (as well as other parts of speech) as a matter of design, because inflectional irregularities are considered to increase the difficulty of learning and using a language. Other constructed languages, however, need not show such regularity, especially if they are designed to look similar to natural ones.

The auxiliary language Interlingua has some irregular verbs, principally esser "to be", which has an irregular present tense form es "is" (instead of expected esse), an optional plural son "are", an optional irregular past tense era "was/were" (alongside regular esseva), and a unique subjunctive form sia (which can also function as an imperative). Other common verbs also have irregular present tense forms, namely vader "to go" — va, ir "to go" — va (also shared by the present tense of vader), and haber "to have" — ha.

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
In linguistics, regular verbs follow predictable inflection patterns for forms like the past tense, while irregular verbs deviate with unique changes that must often be memorized. In English grammar, this distinction is based on how verbs form their past tense and past participle.[1] Regular verbs follow a predictable morphological rule by appending the suffix -ed (or -d after vowels) to their base stem, allowing for consistent inflection across thousands of verbs, including neologisms like spam-spammed.[1] In contrast, irregular verbs—numbering around 180 in modern English—deviate from this pattern, employing unique stem changes, vowel alternations (ablaut), or other non-standard forms that must be memorized as lexical entries, such as go-went-gone or sing-sang-sung.[1][2] This distinction arises from the interplay between rule-based computation and associative memory in human language processing.[1] Regular forms are generated productively via a symbolic rule in the procedural memory system, enabling generalization to novel words and explaining why children initially overapply -ed to irregulars (e.g., holded for held).[1] Irregular forms, however, are retrieved holistically from declarative memory, often showing frequency effects and resistance to regularization, as evidenced in neuropsychological studies where brain damage selectively impairs irregular production.[1] Historically, many English irregular verbs trace back to strong (ablaut-based) conjugations in Proto-Indo-European and Old English, while the regular -ed pattern emerged as a weak conjugation that has since dominated, absorbing most new verbs.[3] The regular-irregular divide has profound implications for linguistic theory, particularly in debates over whether language relies on discrete rules or connectionist networks of associations.[1] Proponents of the words-and-rules model argue that the asymmetry—productivity for regulars versus storage for irregulars—reflects innate dual mechanisms in the brain, supported by evidence from language acquisition, aphasia, and cross-linguistic comparisons.[1] Connectionist alternatives propose a single associative system that approximates rules through pattern learning, though they struggle to account for the sharp boundary in generalization behaviors.[1] In second-language learning, irregular verbs pose greater challenges due to their unpredictability, often requiring explicit rote memorization.[4]

Definitions and Fundamentals

Regular Verbs

Regular verbs are those that adhere to a standard set of morphological rules when forming their inflected forms, such as the past tense, past participle, and present participle, typically by appending consistent affixes to the verb stem without altering the stem itself. In English, for instance, the past tense and past participle are most commonly formed by adding the suffix -ed to the base form, while the present participle uses -ing, and the third-person singular present tense adds -s or -es. This regularity applies across a wide range of verbs, making their conjugation predictable and governed by phonological and orthographic conventions rather than exceptions. The core characteristics of regular verbs include their reliance on affixation without internal stem modifications, such as vowel shifts (ablaut) or suppletive forms where entirely different roots are used for different tenses. This predictability stems from phonological rules that determine the exact form of the suffix; for example, after voiceless consonants, the -ed suffix is pronounced /t/, after voiced consonants or vowels it is /d/, and after /t/ or /d/ it becomes /ɪd/. Unlike irregular verbs, regular verbs do not exhibit stem changes, ensuring uniformity in inflection patterns that simplifies syntactic integration in sentences. To illustrate, consider the regular verb "walk," which conjugates as follows in the indicative mood:
FormConjugation
Base formwalk
Third-person singularwalks (/wɔːks/)
Past tensewalked (/wɔːkt/)
Past participlewalked (/wɔːkt/)
Present participlewalking (/ˈwɔːkɪŋ/)
Similarly, "play" follows the same pattern:
FormConjugation
Base formplay
Third-person singularplays (/pleɪz/)
Past tenseplayed (/pleɪd/)
Past participleplayed (/pleɪd/)
Present participleplaying (/ˈpleɪɪŋ/)
These examples highlight the consistent application of affixes, with adjustments for phonotactics like the /z/ sound in "plays" due to the preceding vowel. In the broader morphology of languages like English, regular verbs play a crucial role by providing a default, rule-based system that enhances learnability and cognitive processing efficiency for speakers and learners alike. Their standardized forms reduce the cognitive load in language acquisition, as rules can be generalized rather than memorized individually, contributing to the productivity of the verbal system where new verbs (e.g., "google") are often integrated as regular. This contrasts with irregular verbs, which require rote learning due to their deviations.

Irregular Verbs

Irregular verbs deviate from the standard inflectional pattern of English verbs, which typically involves adding the suffix -ed to form the past tense and past participle (as in walk-walked). There are approximately 200 irregular verbs in modern English. Instead, their forms often require memorization because they follow antiquated rules or no predictable rule at all, such as vowel modifications or stem replacements. These include "strong" verbs inherited from Germanic languages that employ ablaut (internal vowel gradation) rather than affixation, as well as "weak" irregulars that exhibit exceptions to the -ed rule.[5][6] The main types of irregularity are ablaut, suppletion, and partial irregularities. Ablaut involves a systematic change in the root vowel to indicate tense, as seen in verbs like sing (present: sing, past: sang, past participle: sung) or fall (present: fall, past: fell, past participle: fallen), reflecting patterns from seven historical classes of strong verbs.[7][5] Suppletion occurs when entirely different roots supplant the original stem for certain forms, exemplified by go (present: go, past: went—from an unrelated Old English verb wende—past participle: gone). Partial irregularities blend elements of regular and irregular patterns, such as have (present: have, past: had, past participle: had), where the past form deviates but follows a somewhat predictable alteration without full affixation.[7][5] A prime example of an irregular verb with extensive irregularity is be, the most suppletive and morphologically complex in English, showing distinct forms across persons, numbers, and tenses due to its fusion of multiple ancestral roots. Its full paradigm in the indicative mood is as follows:
  • Present tense: I am, you are, he/she/it is, we/you/they are
  • Past tense: I/he/she/it was, you/we/they were
  • Past participle: been (used with have/has/had, e.g., I have been)
This verb's forms, totaling over a dozen unique variants when including subjunctive and imperative moods, underscore the depth of irregularity in high-utility auxiliaries.[8][5] Irregular verbs hold functional significance as they predominantly comprise the language's most frequent items—such as be, have, and do—which together account for a substantial portion of verbal usage in frequency databases like CELEX, derived from corpora totaling over 17 million words. This high frequency slows their regularization, with evolutionary models showing that irregular forms persist longer in common verbs (half-life of about 2,000 years for those used once per 1,000–10,000 words) compared to rarer ones (half-life of 300 years for those used once per million words). As relics of Proto-Indo-European and Old English systems, where ablaut was productive across hundreds of verbs (now reduced to about 98 strong irregulars from 177 originals), they illustrate how usage-driven retention preserves exceptions amid ongoing language regularization.[5]

Morphological Patterns

Regular Inflection Patterns

Regular verb inflections typically involve affixation, where morphological markers such as suffixes are added to the verb stem to indicate grammatical categories like tense, aspect, mood, person, and number.[9] These processes often require phonological adjustments to ensure euphonic integration, including assimilation—where adjacent sounds influence each other, such as the voicing of a suffix to match the stem—and epenthesis, the insertion of a vowel to break up consonant clusters, as seen in English past tense forms like "wanted" where /ɪ/ is added after /t/ or /d/. In many languages, these adjustments follow predictable rules governed by the phonology of the stem's final segment, promoting systematicity in inflection.[10] Universal patterns in regular verb inflection include subject-verb agreement, where the verb form aligns in person and number with the subject, and tense/aspect marking, which varies between synthetic languages that fuse multiple categories into a single word via affixes and analytic languages that rely more on auxiliary verbs or particles.[11] Synthetic languages, such as Latin or Russian, express agreement and tense through extensive suffixation on the verb itself, while analytic ones like modern English use limited inflection alongside word order and helpers.[12] These patterns facilitate cross-linguistic comparability, with agreement often marking subject features like person (first, second, third) and number (singular, plural), and tense indicating time relative to the present.[11] In English, the third-person singular present indicative is formed by adding -s (or -es) to the verb stem, with pronunciation varying by the stem's final sound: /s/ after voiceless consonants (e.g., "walks" /wɔːks/), /z/ after voiced sounds or vowels (e.g., "plays" /pleɪz/), and /ɪz/ after sibilants to avoid clustering (e.g., "watches" /wɒtʃɪz/).[13] This rule applies uniformly to regular verbs, ensuring the form signals agreement without altering the stem. In Latin, regular verbs in the first conjugation, such as amō ("I love"), follow a paradigm in the present indicative active: first person singular -ō (amō), second singular -ās (amās), third singular -at (amat), first plural -āmus (amāmus), second plural -ātis (amātis), and third plural -ant (amant), derived by attaching these endings directly to the stem am- after removing the infinitive's -āre.[14] Second conjugation verbs like moneō ("I warn") use -eō, -ēs, -et, etc., on the stem mone-, with stem vowel adjustments to maintain distinction across conjugations.[15] The productivity of regular patterns is evident in how neologisms and borrowed verbs readily adopt them, as the default rules apply without exception; for instance, the verb "email" inflects as "emails" in third-person singular and "emailed" in past tense, demonstrating the system's extensibility to new lexical items. Irregular deviations serve merely as exceptions to these productive rules, highlighting the regularity's dominance in novel formations.[16]

Irregular Inflection Patterns

Irregular verbs deviate from the standard affixation seen in regular verbs by employing stem-internal modifications or replacements to mark tense and aspect. These patterns, inherited from Proto-Indo-European and adapted in daughter languages, include vowel alternations, complete stem substitutions, and copying mechanisms that create non-predictable forms.[17] Ablaut, or apophony, involves systematic vowel gradations within the verb stem to distinguish grammatical categories such as present, preterite, and past participle, a hallmark of Indo-European strong verbs. In Germanic languages, this manifests in seven historical classes, each with distinct vowel patterns derived from Proto-Germanic ablaut series; for instance, Class III features patterns like i-a-u, as in English drink-drank-drunk. These alternations reflect ablaut grades (e.g., full, zero) from Proto-Indo-European, where vowel quality encoded tense without suffixes.[17][18] Suppletion represents a more extreme irregularity, where different roots—etymologically unrelated or distantly related—replace the stem across paradigm cells, often due to historical mergers of verbs. In Romance languages, this is evident in French aller ('to go'), which combines forms from Latin ambulare, vadere, and ire: present vais-allons, future irai, past participle allé. Such patterns are constrained by paradigm dependencies, limiting suppletive allomorphy to specific zones like the present tense.[19][20] Reduplication, a rarer pattern involving partial or full copying of the stem, appears in Indo-European and Niger-Congo languages to signal aspect or intensity. In Sanskrit, perfect forms use CV reduplication, as in dvis ('hate') yielding di-dvis-a, while intensive forms employ a heavy Ca(X) template, simplifying onsets: krand ('cry out') becomes kan-i-krand. In Bantu languages, verb-stem reduplication often conveys iterativity or distributivity through full copying, including suffixes; for example, in Ciyao, telek-a ('cook') reduplicates to telek-a + telek-a ('cook frequently'). These mechanisms preserve prosodic structure while altering meaning.[21][22] Irregular verbs are classified into strong and weak categories in Germanic linguistics, with strong verbs relying on ablaut for preterite formation (e.g., sing-sang-sung) and weak verbs adding a dental suffix despite irregularities (e.g., think-thought). A distinct subclass, preterite-present verbs, originates from Proto-Germanic modals whose present forms derive from old strong preterites, resulting in weak preterite endings but irregular presents: English can-could, may-might. This classification highlights ablaut's persistence amid analogical pressures from weak patterns.[18][23]

Orthographic Irregularities

Orthographic irregularities in verb inflections arise when spelling conventions create apparent deviations from standard patterns without altering the underlying morphological structure, often due to English's non-phonetic writing system. These occur in regular verbs where the past tense or participle is formed by adding -ed or -d, but orthographic rules adjust the spelling to preserve pronunciation or historical forms. For instance, verbs ending in a single consonant following a short vowel double the consonant before adding -ed or -ing to indicate that the preceding vowel remains short, as in "stop" becoming "stopped" rather than "stoped."[24] Another common orthographic adjustment involves the y-to-i rule, where verbs ending in a consonant plus -y change the -y to -i before adding -ed, such as "study" to "studied," to avoid awkward spelling sequences while keeping the regular -ed suffix. This rule applies only when the suffix begins with a vowel other than -i; for -ing, the -y remains, as in "studying."[24] Such mismatches between spelling and sound, including silent letters or vowel shifts not reflected in writing, can make inflections seem irregular at first glance. A notable example is the verb "dream," which follows the regular pattern as "dreamed" but also accepts "dreamt" as an alternative past form, where the -t ending reflects a historical orthographic preference for brevity over the standard -ed, leading to dual acceptable spellings in modern usage.[25] These orthographic features pose significant challenges for language learners, as English's deep orthography—characterized by inconsistent grapheme-phoneme mappings—slows reading acquisition and increases error rates in verb conjugation compared to languages with shallower orthographies.[26] For bilingual learners, exposure to English's irregularities can interfere with processing speed, with studies showing slower reading times in opaque orthographies like English due to orthographic opacity, exacerbating confusion in verb forms where spelling does not reliably signal pronunciation changes.[27] This non-phonetic system demands greater reliance on memorization and morphological awareness, contributing to higher rates of spelling errors in inflected verbs among non-native speakers.[26]

Historical Development

Origins in Proto-Indo-European

The Proto-Indo-European (PIE) language, spoken approximately between 4500 and 2500 BCE in the Pontic-Caspian steppe region, featured a complex verbal system characterized by aspectual distinctions and intricate morphological patterns.[28] This system relied heavily on ablaut—a process of vowel gradation within roots and affixes—to encode grammatical categories such as tense, mood, and number. Ablaut operated through seven grades: zero-grade (∅), e-grade (*e), o-grade (*o), lengthened e-grade (*ē), lengthened o-grade (*ō), laryngeal-influenced long vowels (e.g., *ā, *ī), and compensatory lengthening from laryngeal loss.[29] These grades allowed for systematic alternations, as seen in root forms where *e could shift to *o in certain contexts or disappear entirely in zero-grade, marking differences between strong and weak stems within paradigms.[30] PIE verbs were conjugated in two primary classes: athematic and thematic. Athematic verbs, the older type, lacked a connecting vowel and directly attached endings to the root, resulting in frequent ablaut alternations across the paradigm; for example, the root *h₁es- "be" showed *h₁és-mi (1sg present) versus *h₁s-ntí (3pl present).[31] Thematic verbs, by contrast, inserted a thematic vowel *e/o (ablauting to zero only outside the paradigm), which stabilized forms and reduced internal vowel variation, as in *bʰér-e-ti "carries" (1sg present) from the root *bʰer- "carry."[32] This distinction arose from morphological innovations, with thematic formations likely emerging to simplify conjugation for newer or derived verbs.[33] Reconstruction of the PIE verbal system relies on comparative linguistics, analyzing cognates across daughter languages like Sanskrit, Ancient Greek, Latin, and Hittite to posit proto-forms. For instance, the root *bʰer- "carry" is reconstructed with forms such as *bʰér-o-m (1sg present thematic), *bʰér-o-nt-i (3pl present), and *bʰŕ̥-né-h₂ (perfect middle), evidenced by reflexes like Sanskrit bhárati, Greek phérei, and Latin ferō.[34] These comparisons reveal how ablaut patterned with accent: full-grade (*e/o) in accented syllables and zero-grade in unaccented ones, preserving archaic features in athematic paradigms.[31] The origins of the regular-irregular divide trace to this athematic (precursor to "strong" or irregular) versus thematic (precursor to "weak" or regular) split, where ablaut-based inflection in athematic verbs created non-suffixal alternations, while thematic verbs used affixal endings with minimal root change.[33] The laryngeal theory further elucidates these vowel alternations, positing three laryngeals (*h₁, *h₂, *h₃) that colored adjacent vowels (*h₂e > *a, *h₃e > *o) or lengthened them upon disappearance (e.g., *eh₁ > *ē), as confirmed by Hittite evidence like išḫāi "binds" from *h₂eǵ- with laryngeal preservation.[30] This theory, initially proposed by Saussure in 1879 and validated in the 20th century, accounts for otherwise irregular vocalisms in ablaut series, such as Greek títhēmi "places" from *dʰeh₁-.[30]

Evolution in Germanic Languages

The evolution of verb inflection in Germanic languages built upon the ablaut-based system inherited from Proto-Indo-European (PIE), where vowel alternations marked tense and mood in strong verbs. In Proto-Germanic, this system was reorganized into seven classes of strong verbs, characterized by stem vowel changes (ablaut) without additional suffixes for the preterite, as seen in Old English examples like singan (infinitive), sang (preterite singular), and sungen (past participle) from class III. Weak verbs, an innovation in Germanic, emerged alongside these, employing a dental suffix (-d- or -t-) added to the stem to form the preterite, as in Old English lufian (to love), lufode (loved). This distinction arose around the 1st millennium BCE, with weak verbs likely originating from causatives or denominatives in PIE that adopted the dental formative for simplicity.[35] Sound shifts profoundly shaped these patterns, notably Grimm's Law, which systematically altered PIE stop consonants into fricatives or voiced stops in Proto-Germanic, influencing the consonantal environment around ablaut vowels and thus the overall verb morphology. For instance, the PIE root *h₁ed- ("eat") underwent these changes: the e-grade became Proto-Germanic *etaną (infinitive), while the preterite singular *ēt (lengthened grade, anomalous) yielded Old English ēt or ǣt (ate singular) and ēton (plural). These shifts, occurring between 500 BCE and 1 CE, disrupted uniform ablaut series across classes, leading to mergers and analogical leveling in daughter languages such as Old High German and Old Norse. By the early medieval period, such phonological pressures accelerated the differentiation between strong and weak paradigms.[36][37] Historical events further drove changes, particularly in English, where Viking invasions from the late 8th to 11th centuries introduced Old Norse, a closely related Germanic language, fostering bilingualism and syntactic simplification that indirectly affected verb forms. This contact in northern and eastern England promoted the leveling of inflections, hastening the shift of some strong verbs to weak patterns through analogy and reduced morphological complexity, as Old Norse shared similar strong verb classes but favored periphrastic constructions. Over time, strong verbs declined: Old English had approximately 333 strong verbs, but by the Middle English period (1150–1500 CE), over 90 had disappeared, 81 shifted to weak conjugation, and only 68 remained fully strong, with further losses in Early Modern English.[38][39] By the 5th to 11th centuries CE, weak verbs had become the dominant and productive class across Germanic languages, applied to new coinages and borrowings due to their straightforward suffixation, while high-frequency strong verbs like beon (to be) persisted as relics. This rise was fueled by type and token frequency effects, where less frequent strong verbs succumbed to regularization, stabilizing the modern regular-irregular divide seen in languages like German (gehen - weak) versus English (go-went - strong). The weak paradigm's universality thus marked a key Germanic innovation, prioritizing morphological economy over inherited ablaut complexity.[18]

Linguistic Perspectives

Theoretical Approaches

In generative grammar, as developed by Noam Chomsky, the distinction between regular and irregular verbs is framed within a modular architecture where productive rules generate systematic forms, while irregulars are treated as exceptions stored in the lexicon. Chomsky's early frameworks, such as those in Aspects of the Theory of Syntax (1965), posit that the grammar consists of a finite set of rules operating on lexical items to produce infinite structures, with the lexicon serving as a repository for idiosyncratic forms that evade full rule application. Irregular verbs, like go-went, are thus memorized as whole units or tagged with specific transformations, preventing overgeneralization of regular rules (e.g., -ed suffixation for past tense). This rules-versus-lexicon dichotomy underscores the productivity of regular inflection, where rules apply productively to novel forms, contrasting with the rote storage of irregulars to account for their persistence despite pressure toward regularization. Optimality Theory (OT), introduced by Alan Prince and Paul Smolensky, approaches verb inflection through a system of ranked, violable constraints that evaluate candidate outputs for the optimal form, balancing faithfulness to the input against markedness preferences for simpler structures. In morphological applications, faithfulness constraints (e.g., MAX-IO, preserving input segments) compete with markedness constraints (e.g., *COMPLEX, penalizing added material like suffixes), allowing regular inflections to emerge as optimal for most verbs while irregular forms arise when higher-ranked faithfulness preserves historical alternations, as in English sing-sang. For instance, in subregular classes like hide-hid, OT posits constraint interactions where partial faithfulness to ablaut patterns outranks full regularization, explaining the coexistence of patterns without invoking separate rule blocks. This framework, extended to morphology in works like Russell (1997), highlights how universal constraints interact with language-specific rankings to maintain irregularity in high-frequency items.[40][41] Usage-based models, particularly Joan Bybee's network approach, view regular and irregular verb patterns as emergent from interconnected lexical representations shaped by token and type frequency, rather than innate rules. In Bybee's schema model, verbs cluster in networks based on phonological and semantic similarity, with high-frequency irregulars (e.g., be-was) forming strong, entrenched connections that resist change, while low-frequency forms analogize to productive schemas like the regular past tense. These schemas—abstract generalizations over stored exemplars—emerge dynamically, enabling productivity through partial generalizations, such as extending -ed to novel verbs. Bybee (1995) argues that irregularity persists in dense network regions, where multiple exemplars reinforce non-default patterns, but erodes elsewhere via leveling toward dominant schemas.[42] Central to these theories is the productivity of regular verbs, defined as their ability to generate novel forms without lexical exceptions, as quantified by the Tolerance Principle where a rule remains productive if exceptions (M) fall below a threshold relative to total instances (N), such as M < N/ln N for English past tense (Yang, 2005). Analogical leveling further explains the historical reduction of irregularities, as irregular forms conform to regular patterns through proportion exchange (e.g., dive-dove to dive-dived), driven by the dominance of productive rules in sparse network areas. This process, observed across Indo-European languages, aligns generative and usage-based views by positing that while irregulars are lexically entrenched, systemic pressures favor regularization over time.[43][44]

Psycholinguistic and Frequency Effects

In the acquisition of verb morphology, children initially produce correct forms of irregular verbs through rote memorization but later overregularize them by applying the default past-tense rule, such as saying "goed" instead of "went" or "comed" instead of "came."[45] This pattern exemplifies a U-shaped learning curve, where accuracy declines temporarily as children generalize the regular -ed suffix before recovering the irregular forms through further exposure and reinforcement.[46] Such overregularizations peak around ages 3-4 in English-speaking children and are less frequent for high-frequency irregulars, reflecting the interplay between rule abstraction and lexical storage during early language development.[47] Psycholinguistic processing of regular and irregular verbs is often explained by dual-route models, which posit a rule-based pathway for regulars—combining stems with affixes like -ed via symbolic computation—and an associative pathway for irregulars, relying on direct lexical retrieval from memory.[1] Functional magnetic resonance imaging (fMRI) studies support this distinction, showing greater activation in frontal regions, such as the left inferior frontal gyrus, for regular inflections due to rule application, while irregulars engage temporal and parietal areas associated with declarative memory retrieval.[48] For instance, processing novel regular forms like "wugged" activates procedural memory networks more than familiar irregulars like "kept," highlighting domain-specific neural substrates for morphological computation.[49] Frequency profoundly influences the stability of irregular verbs, with high-frequency forms resisting regularization over time while low-frequency ones are more prone to it. In English, irregular verbs constitute only about 3% of the total verb lexicon but account for a disproportionately large share of usage, including the ten most common verbs (e.g., "be," "have," "do"), which reinforces their irregular patterns through repeated exposure.[5] High-frequency irregulars like "keep-kept" remain stable, whereas low-frequency ones like "sweep" occasionally appear in regularized forms such as "sweeped" in modern corpora, accelerating their potential shift toward regularity.[5] Cross-linguistically, similar patterns emerge in Germanic and Romance languages, where irregulars form a small proportion of verbs but dominate token frequency, aiding their preservation despite pressures toward analogical regularization.

Cross-Linguistic Examples

English Verbs

In English, verbs are broadly divided into weak (regular) and strong (irregular) categories based on how they form the past tense and past participle. Weak verbs, which constitute the majority, add the suffix -ed (or -d after verbs ending in -e) to the base form for both the simple past and past participle, as in walk (base), walked (past), walked (past participle). This pattern, known as the dental preterite, applies to over 97% of modern English verbs and reflects a productive rule that allows new verbs to integrate easily. Strong verbs, by contrast, rely on internal vowel alternation (ablaut) or other non-suffix changes, such as come (base), came (past), come (past participle), without the -ed ending. These irregularities stem from inherited patterns in Proto-Germanic, where vowel gradation marked tense. English has approximately 200 irregular verbs in common use, representing less than 3% of the total verb inventory, though they dominate high-frequency contexts.[5][5] The most frequent irregular verbs are disproportionately represented in everyday speech and writing, with the top ten all exhibiting irregular conjugation patterns. This skew toward irregularity among common verbs underscores their resistance to regularization despite the dominance of weak forms overall. Below is a table of the ten most frequent irregular verbs, ranked by corpus-based usage frequency, along with their principal parts (base, simple past, past participle). These forms are essential for basic communication, as verbs like be, have, and do appear in nearly every sentence involving tense or aspect.[50]
RankBase FormSimple PastPast Participle
1bewas/werebeen
2havehadhad
3dodiddone
4saysaidsaid
5getgotgot/gotten
6makemademade
7gowentgone
8knowknewknown
9taketooktaken
10seesawseen
Conjugations for these verbs often include additional irregularities, such as suppletive forms (go-went-gone) or lack of infinitives. Extending to the top 50 by frequency, patterns repeat across classes like Class 1 (ride-rode-ridden) and Class 7 (fall-fell-fallen), but full lists reveal about 98 persistent strong verbs from Old English origins.[5][51] Historically, the Norman Conquest of 1066 accelerated morphological simplification in English verbs, as contact with Norman French reduced complex inflections and favored analytic structures over synthetic ones, indirectly promoting the -ed regularization for many weak verbs while preserving core strong forms among high-frequency items. This period saw the loss of some Old English strong verbs through analogy to weak patterns, contributing to the current inventory of around 200 irregulars. In contemporary English, regularization continues as a dynamic process, with less frequent irregular verbs increasingly adopting -ed forms; for instance, dive traditionally uses the regular dived (past and past participle) since the 14th century, though the irregular variant dove emerged in American English by the 19th century and persists in informal speech, illustrating ongoing variation.[52][53][54] English exhibits unique aspects in its irregular verb system through phrasal verbs and modal auxiliaries, which introduce semi-irregular behaviors. Phrasal verbs, such as give up or look after, conjugate according to the base verb's pattern—regular if the base is weak (pick up-picked up-picked up) or irregular if strong (set off-set off-set off, with no change)—but their idiomatic meanings and separable particles create perceived irregularity in usage and syntax, as particles may shift position in certain constructions (e.g., turn the light off vs. turn off the light). Modal verbs like can, must, and shall are highly defective and irregular, lacking full paradigms: they have no -s inflection in the third person singular (he can, not he cans), no participles or infinitives, and irregular past forms via suppletion (can-could, will-would). This defectiveness limits their tense flexibility, often requiring periphrastic alternatives (e.g., be able to for non-past can). These features highlight English's blend of Germanic irregularity with analytic tendencies.[55][56]

Verbs in Romance and Other Indo-European Languages

In Romance languages, verb inflection largely derives from Latin's four conjugations, which provide a systematic regular base characterized by distinct stem vowels (-ā-, -ē-, -ĕ-, -ī-) in the present infinitive and consistent patterns across tenses.[57] These conjugations persist in modern descendants like Spanish, French, and Italian, where most verbs follow predictable endings for person, number, tense, and mood, though with phonological adaptations. Irregularities arise primarily in high-frequency verbs, often through stem changes or suppletion, reflecting historical mergers and sound shifts from Vulgar Latin. Stem-changing verbs represent a common irregularity in Romance, particularly in the present tense of Spanish and Portuguese, where the stem vowel diphthongizes or alters in certain forms due to stress patterns inherited from Latin. For example, the Spanish verb pensar ('to think') changes from pens- to piens- in the first- and third-person singular and plural present indicative (pienso, piensa, piensan), a semi-regular pattern seen in verbs with mid-vowels under stress.[58] Suppletive forms, where unrelated stems combine to form a single paradigm, are typical of core verbs like the French copula être ('to be'), which merges elements from Latin esse ('to be') and stāre ('to stand'), yielding irregular roots such as suis (from sum), es (from es), and fut (from a suppletive future in fuī).[59] Beyond Romance, other Indo-European branches exhibit family-specific patterns blending regularity and irregularity. In Slavic languages, aspectual pairs—imperfective and perfective forms of the same verb—are often formed regularly via prefixes on the imperfective stem, encoding completion or iteration without altering the root; for instance, Russian čitatʹ ('to read', imperfective) pairs with prefixed perfectives like pro-čitatʹ ('to read through') or za-čitatʹ ('to start reading'), where prefixes function as lexico-grammatical classifiers unitizing events.[60] Ancient Greek aorist stems, marking simple past actions, typically add a regular augment (ἐ-) to the verb root in the indicative, but irregularities occur in second aorists, which lack the sigmatic -σα- marker and use root-modified forms, as in ἔλαβον ('I took') from λαμβάνω ('to take'), reflecting athematic or vowel-grade alternations from Proto-Indo-European.[61] Cross-branch comparisons reveal that Latin's four conjugations form a relatively regular foundation, with irregularities concentrated in 20-30% of verbs across many Indo-European languages, predominantly high-frequency ones that resist regularization due to usage. In Italian, a Romance example, andare ('to go') shows irregularity across tenses, with present forms like vado deriving from Latin vadō and past andai from a suppletive blend involving ambulare and vadere, yielding the participle andato. Similarly, in Indo-Aryan Hindi, the verb jānā ('to go') employs periphrastic constructions for perfect aspects, combining the root participle gayā (from an older perfect stem) with auxiliaries like hai ('is') to form gayā hai ('has gone'), diverging from simpler transitive patterns.[62] These patterns parallel English irregularities, such as go-went, in prioritizing frequent verbs for suppletive or altered forms.

Verbs in Non-Indo-European Languages

In non-Indo-European languages, verb systems often prioritize regularity through agglutinative, templatic, or analytic strategies, leading to typological patterns that contrast with the fusional irregularities prevalent in Indo-European families. These structures typically apply consistent morphological rules across large portions of the lexicon, resulting in fewer exceptions and more predictable conjugation paradigms. For instance, agglutinative languages build verb forms by sequentially adding affixes to stems, while templatic systems derive forms from abstract root patterns, both minimizing the need for suppletive or stem-changing irregularities.[63] Turkic languages, part of the proposed Altaic family, exemplify agglutinative regularity in verb morphology, where suffixes denote tense, person, number, and mood, harmonizing vowels with the stem for phonological coherence. Turkish verbs, in particular, exhibit almost no irregularities, as conjugation relies on invariant stems and predictable affixation; the verb git- ("go") forms the first-person past as gittim via standard suffix -tim. Exceptions are rare and limited to auxiliary-like verbs such as olmak ("be"), which may alter slightly in certain contexts, but the overall system remains highly uniform.[64] Semitic languages employ a root-and-pattern morphology that generates verb forms systematically from consonantal roots, typically triconsonantal, ensuring regularity in derivation and inflection. In Arabic, strong verbs follow fixed templates across tenses; for example, the root k-t-b ("write") yields the perfect kataba ("he wrote") and imperfect yaktubu ("he writes") through consistent vowel insertion and prefixation. Irregularities arise primarily in weak verbs with radicals like ʔ (hamza) or w/y, leading to assimilation or elision akin to broken plurals in nominal morphology, such as shortened forms in defective roots, though these affect only a subset of the lexicon.[65] Austronesian languages frequently use reduplication as a regular derivational device for marking verbal aspect, plurality, or iteration, with partial or full copying of the stem applied predictably to convey nuances like progressive or distributive actions. In languages such as Tagalog, reduplication of the initial syllable signals imperfective aspect, as in ta-takbo ("running") from takbo ("run"), maintaining stem integrity without exception. Suppletive pronouns, which vary in form across cases or numbers, often cliticize directly to verbs, influencing agreement and introducing minor allomorphic adjustments, but this does not disrupt the core regularity of the verbal paradigm.[66] Across these families, the prevalence of agglutinative and templatic mechanisms results in few true irregular verbs, with exceptions confined to high-frequency or auxiliary items, fostering overall lower irregularity compared to fusional systems.[67]

Constructed Languages

Constructed languages, or conlangs, often feature verb systems engineered for predictability and ease of acquisition, contrasting with the organic irregularities of natural languages. In auxiliary languages (auxlangs) and logical languages (loglangs), creators prioritize full regularity to facilitate international communication or precise expression, eliminating exceptions that complicate learning. Artistic languages (artlangs), however, may incorporate irregularities to evoke aesthetic depth or mimic historical evolution, such as Indo-European ablaut patterns. This intentional design reflects broader principles of linguistic engineering, where regularization enhances learnability while exceptions add cultural or narrative flavor.[68] Esperanto exemplifies maximal regularization in verb morphology, with all verbs following a uniform paradigm regardless of person, number, or root. The system uses fixed endings for tenses and moods: -as for present indicative (e.g., mi parolas, "I speak"), -is for past (mi parolis, "I spoke"), -os for future (mi parolos, "I will speak"), and -us for conditional (mi parolus, "I would speak"). There are no irregular verbs, a deliberate choice by creator L. L. Zamenhof to promote accessibility and rapid mastery among diverse speakers. This uniformity stems from Esperanto's auxlang goals, avoiding the memorization burdens of natural language exceptions.[69][70] Other conlangs display varied approaches to verb regularity. Volapük, an early auxlang by Johann Martin Schleyer, employs a highly inflected but consistent verb system with prefixes for tense and suffixes for person and number, though its complexity in combining affixes can introduce partial irregularities in derivation and usage. In contrast, J. R. R. Tolkien's Quenya, an artlang inspired by ancient tongues, retains echoes of Indo-European ablaut in strong verbs, where past tenses involve vowel changes (e.g., car- "to do" becomes carë in past), blending regularity with archaic irregularity for mythological authenticity. Lojban, a loglang rooted in predicate logic, treats "verbs" as invariant predicates without traditional conjugation; relations like tense or aspect are expressed via separate words (e.g., broda "is x1 related to x2 by property bridi"), ensuring complete regularity and unambiguity in logical structure.[71][72] Klingon, the artlang developed by Marc Okrand for the Star Trek universe, maintains regular verb conjugation through a system of prefixes for subject-object agreement and suffixes for tense/aspect (e.g., jIH tlhIngan Hol DajontaH "I am learning Klingon"), with no stem-altering irregularities to emphasize its alien yet learnable design. Across these examples, loglangs like Lojban and auxlangs like Esperanto favor exception-free systems to minimize cognitive load, while artlangs like Quenya and Klingon permit controlled irregularities for expressive or immersive purposes, highlighting the spectrum of conlang design philosophies.[73][74]

References

User Avatar
No comments yet.