Syllabification

Syllabification (/sɪˌlæbɪfɪˈkeɪʃən/) or syllabication (/sɪˌlæbɪˈkeɪʃən/) is the separation of a word into syllables, whether spoken, written^[1] or signed.^[2] Separating written words into syllables is called 'hyphenation'.

Overview

The written separation into syllables, that is 'hyphenation', is usually marked by a hyphen when using English orthography (e.g., syl-la-ble) and with a period when transcribing the actually spoken syllables in the International Phonetic Alphabet (e.g., [ˈsɪl.ə.bᵊɫ]). For presentation purposes, typographers may use an interpunct (Unicode character U+00B7, e.g., syl·la·ble), a special-purpose "hyphenation point" (U+2027, e.g., syl‧la‧ble), or a space (e.g., syl la ble).

At the end of a line, a word is separated in writing into parts, conventionally called "syllables", if it does not fit the line and if moving it to the next line would make the first line much shorter than the others. This can be a particular problem with very long words, and with narrow columns in newspapers. Word processing has automated the process of justification, making syllabification of shorter words often unnecessary.

In some languages, the spoken syllables are also the basis of syllabification in writing. However, possibly due to the weak correspondence between sounds and letters in the spelling of modern English, written syllabification in English is based mostly on etymological or morphological, instead of phonetic, principles. For example, it is not possible to syllabify "learning" as lear-ning according to the correct syllabification of the living language. Seeing only lear- at the end of a line might mislead the reader into pronouncing the word incorrectly, as the digraph ea can hold many different values. The history of English orthography accounts for such phenomena.

English written syllabification therefore deals with a concept of "syllable" that does not correspond to the linguistic concept of a phonological (as opposed to morphological) unit.

As a result, even most native English speakers are unable to syllabify words according to established rules without consulting a dictionary or using a word processor. Schools usually do not provide much more advice on the topic than to consult a dictionary. In addition, there are differences between British and US syllabification and even between dictionaries of the same English variety.

In Finnish, Italian, Portuguese, Japanese (Romaji), Korean (Romanized) and other nearly phonemically spelled languages, writers can in principle correctly syllabify any existing or newly created word using only general rules. In Finland, children are first taught to hyphenate every word until they produce the correct syllabification reliably, after which the hyphens can be omitted.

Algorithm

Unsolved problem in computer science

Is there any perfect syllabification algorithm in English language?

More unsolved problems in computer science

A hyphenation algorithm is a set of rules, especially one codified for implementation in a computer program, that decides at which points a word can be broken over two lines with a hyphen. For example, a hyphenation algorithm might decide that impeachment can be broken as impeach-ment or im-peachment but not impe-achment.

One of the reasons for the complexity of the rules of word-breaking is that different dialects of English tend to differ on hyphenation: American English tends to work on sound, but British English tends to look to the origins of the word and then to sound.^{[citation needed]} There is also a large number of exceptions, which further complicates matters.^{[citation needed]}

Among the algorithmic approaches to hyphenation, the one implemented in the TeX typesetting system is widely used. It is thoroughly documented in the first two volumes of Computers and Typesetting by Donald Knuth and in Franklin Mark Liang's dissertation.^[3] The aim of Liang's work was to get the algorithm as accurate as possible and to keep exceptions to a minimum.

In TeX's original hyphenation patterns for American English, the exception list contains only 14 words.^[4]

In TeX

Ports of the TeX hyphenation algorithm are available as libraries for several programming languages, including Haskell, JavaScript, Perl, PostScript, Python, Ruby, C#, and TeX can be made to show hyphens in the log by the command \showhyphens.

In LaTeX, hyphenation correction can be added by users by using:

\hyphenation{words}

The \hyphenation command declares allowed hyphenation points in which words is a list of words, separated by spaces, in which each hyphenation point is indicated by a - character. For example,

\hyphenation{fortran er-go-no-mic}

declares that in the current job "fortran" should not be hyphenated and that if "ergonomic" must be hyphenated, it will be at one of the indicated points.^[5]

However, there are several limits. For example, the stock \hyphenation command accepts only ASCII letters by default and so it cannot be used to correct hyphenation for words with non-ASCII characters (like ä, é, ç), which are very common in many languages. Simple workarounds exist, however.^[6]^[7]

Notes

^ The term is also used for the process of a consonant becoming syllabic. For example, in North Central American English, "can" may be pronounced [kən], or [kn̩] with the a syllabic /n/.
^ Baus, C.; Gutiérrez, E.; Carreiras, M. (13 November 2014). "The role of syllables in sign language production". Frontiers in Psychology. 5: 1254. doi:10.3389/fpsyg.2014.01254. PMC 4230165. PMID 25431562.
^ Liang, Franklin Mark (August 1983). Word Hy-phen-a-tion by Com-pu-ter (PhD). Department of Computer Science, Stanford University. STAN-CS-83-977.
^ "The Plain TeX hyphenation tables" (PDF). Retrieved 23 June 2009.
^ Green, Sheldon (5 June 1995). "\hyphenation". Hypertext Help with LaTeX. Yale Image Processing and Analysis Group. Archived from the original on 27 November 2023.
^ "Accented words aren't hyphenated". TeX FAQ. Archived from the original on 28 November 2023.
^ "How does hyphenation work in TeX?". TeX FAQ. Archived from the original on 27 November 2023.

External links

Online Lyric Hyphenator: Hyphenates English text into syllables
Hyphenation tool for the French Language: Hyphenates French words with explanation

[1] The term is also used for the process of a consonant becoming syllabic. For example, in North Central American English, "can" may be pronounced [kən], or [kn̩] with the a syllabic /n/.

[2] Baus, C.; Gutiérrez, E.; Carreiras, M. (13 November 2014). "The role of syllables in sign language production". Frontiers in Psychology. 5: 1254. doi:10.3389/fpsyg.2014.01254. PMC 4230165. PMID 25431562.

[3] Liang, Franklin Mark (August 1983). Word Hy-phen-a-tion by Com-pu-ter (PhD). Department of Computer Science, Stanford University. STAN-CS-83-977.

[4] "The Plain TeX hyphenation tables" (PDF). Retrieved 23 June 2009.

[5] Green, Sheldon (5 June 1995). "\hyphenation". Hypertext Help with LaTeX. Yale Image Processing and Analysis Group. Archived from the original on 27 November 2023.

[6] "Accented words aren't hyphenated". TeX FAQ. Archived from the original on 28 November 2023.

[7] "How does hyphenation work in TeX?". TeX FAQ. Archived from the original on 27 November 2023.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

Component	Onset	Rhyme
		Nucleus
Example: "street" (/striːt/)	/str/	/iː/

Sound Class	Sonority Level	Examples
Vowels	9	/a/, /i/
Glides	8	/j/, /w/
Liquids	7	/l/, /r/
Nasals	6	/m/, /n/
Voiced Fricatives	5	/v/, /z/
Voiced Stops	4	/b/, /d/
Voiceless Fricatives	3	/f/, /s/
Voiceless Stops	1	/p/, /t/

Pattern Type	Description and Rule Application	Examples
Prefixes	Divisions after prefix, often VCV or at morpheme boundary	in-ter-na-tion-al; un-hap-py
Suffixes	Splits before suffix, respecting closed/open syllables	hap-pi-ness; teach-er
Multisyllabic Words	Combine VCCV/VCV with onset maximization for clusters	in-ter-na-tion-al; bas-ket-ball

History

Syllabification

Recent from talks

Recent from talks

Contribute something

Contribute something

Media Pages

Timelines

Articles

Notes collections

Notes

Notes

Days in Chronicle

Syllabification

Overview

Algorithm

In TeX

See also

Notes

External links

Syllabification

Fundamentals

Definition and Purpose

Basic Syllable Components

Phonological Principles

Syllable Formation Rules

Sonority Hierarchy

Language-Specific Rules

English Syllabification

Rules in Romance Languages

Variations in Non-Indo-European Languages

Computational Approaches

General Algorithms

Hyphenation in TeX

Applications and Implications

Educational Uses

Typographic and Linguistic Analysis

References

Add your contribution

Related Hubs

Contribute something