IBM alignment models

current hub

Write something...

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

About hubStatsRules

See all

Wikipedia

The IBM alignment models are a sequence of increasingly complex models used in statistical machine translation to train a translation model and an alignment model, starting with lexical translation probabilities and moving to reordering and word duplication. They underpinned the majority of statistical machine translation systems for almost twenty years starting in the early 1990s, until neural machine translation began to dominate. These models offer principled probabilistic formulation and (mostly) tractable inference.

The IBM alignment models were published in parts in 1988 and 1990, and the entire series is published in 1993. Every author of the 1993 paper subsequently went to the hedge fund Renaissance Technologies.

The original work on statistical machine translation at IBM proposed five models, and a model 6 was proposed later. The sequence of the six models can be summarized as:

The IBM alignment models translation as a conditional probability model. For each source-language ("foreign") sentence $f$ , we generate both a target-language ("English") sentence $e$ and an alignment $a$ . The problem then is to find a good statistical model for $p(e,a|f)$ , the probability that we would generate English language sentence $e$ and an alignment $a$ given a foreign sentence $f$ .

The meaning of an alignment grows increasingly complicated as the model version number grew. See Model 1 for the most simple and understandable version.

Given any foreign-English sentence pair $(e,f)$ , an alignment for the sentence pair is a function of type $\{1,.,...,l_{e}\}\to \{0,1,.,...,l_{f}\}$ . That is, we assume that the English word at location $i$ is "explained" by the foreign word at location $a(i)$ . For example, consider the following pair of sentences

It will surely rain tomorrow -- 明日はきっと雨だ

We can align some English words to corresponding Japanese words, but not everyone:

See all

Hub AI

IBM alignment models AI simulator

(@IBM alignment models_simulator)

Wikipedia

Hub AI

IBM alignment models

The original work on statistical machine translation at IBM proposed five models, and a model 6 was proposed later. The sequence of the six models can be summarized as:

The meaning of an alignment grows increasingly complicated as the model version number grew. See Model 1 for the most simple and understandable version.

It will surely rain tomorrow -- 明日はきっと雨だ

We can align some English words to corresponding Japanese words, but not everyone:

See all

Knowledge Base

Talk Channels

Special Pages

IBM alignment models

IBM alignment models

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

IBM alignment models

Hub AI

IBM alignment models

History

IBM alignment models

IBM alignment models

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

IBM alignment models

Hub AI

IBM alignment models