Hubbry Logo
Matching penniesMatching penniesMain
Open search
Matching pennies
Community hub
Matching pennies
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Matching pennies
Matching pennies
from Wikipedia
If one penny is heads and the other tails, Odd wins and keeps both coins

Matching pennies is a non-cooperative game studied in game theory. It is played between two players, Even and Odd. Each player has a penny and must secretly turn the penny to heads or tails. The players then reveal their choices simultaneously. If the pennies match (both heads or both tails), then Even wins and keeps both pennies. If the pennies do not match (one heads and one tails), then Odd wins and keeps both pennies.

Theory

[edit]
Heads Tails
Heads +1, −1 −1, +1
Tails −1, +1 +1, −1
Matching pennies

Matching Pennies is a zero-sum game because each participant's gain or loss of utility is exactly balanced by the losses or gains of the utility of the other participants. If the participants' total gains are added up and their total losses subtracted, the sum will be zero.

The game can be written in a payoff matrix (pictured right - from Even's point of view). Each cell of the matrix shows the two players' payoffs, with Even's payoffs listed first.

Matching pennies is used primarily to illustrate the concept of mixed strategies and a mixed strategy Nash equilibrium.[1]

This game has no pure strategy Nash equilibrium since there is no pure strategy (heads or tails) that is a best response to a best response. In other words, there is no pair of pure strategies such that neither player would want to switch if told what the other would do. Instead, the unique Nash equilibrium of this game is in mixed strategies: each player chooses heads or tails with equal probability.[2] In this way, each player makes the other indifferent between choosing heads or tails, so neither player has an incentive to try another strategy. The best-response functions for mixed strategies are depicted in Figure 1 below:

Figure 1. Best response correspondences for players in the matching pennies game. The leftmost mapping is for the Even player, the middle shows the mapping for the Odd player. The sole Nash equilibrium is shown in the right hand graph. x is a probability of playing heads by Odd player, y is a probability of playing heads by Even. The unique intersection is the only point where the strategy of Even is the best response to the strategy of Odd and vice versa.

When either player plays the equilibrium, everyone's expected payoff is zero.

Variants

[edit]
Heads Tails
Heads +7, -1 -1, +1
Tails -1, +1 +1, -1
Matching pennies

Varying the payoffs in the matrix can change the equilibrium point. For example, in the table shown on the right, Even has a chance to win 7 if both he and Odd play Heads. To calculate the equilibrium point in this game, note that a player playing a mixed strategy must be indifferent between his two actions (otherwise he would switch to a pure strategy). This gives us two equations:

  • For the Even player, the expected payoff when playing Heads is and when playing Tails (where is Odd's probability of playing Heads), and these must be equal, so .
  • For the Odd player, the expected payoff when playing Heads is and when playing Tails (where is Even's probability of playing Heads), and these must be equal, so .

Note that since is the Heads-probability of Odd and is the Heads-probability of Even, the change in Even's payoff affects Odd's equilibrium strategy and not Even's own equilibrium strategy. This may be unintuitive at first. The reasoning is that in equilibrium, the choices must be equally appealing. The +7 possibility for Even is very appealing relative to +1, so to maintain equilibrium, Odd's play must lower the probability of that outcome to compensate and equalize the expected values of the two choices, meaning in equilibrium Odd will play Heads less often and Tails more often.

Laboratory experiments

[edit]

Human players do not always play the equilibrium strategy. Laboratory experiments reveal several factors that make players deviate from the equilibrium strategy, especially if matching pennies is played repeatedly:

  • Humans are not good at randomizing. They may try to produce "random" sequences by switching their actions from Heads to Tails and vice versa, but they switch their actions too often (due to a gambler's fallacy). This makes it possible for expert players to predict their next actions with more than 50% chance of success. In this way, a positive expected payoff might be attainable.
  • Humans are trained to detect patterns. They try to detect patterns in the opponent's sequence, even when such patterns do not exist, and adjust their strategy accordingly.[3]
  • Humans' behavior is affected by framing effects.[4] When the Odd player is named "the misleader" and the Even player is named "the guesser", the former focuses on trying to randomize and the latter focuses on trying to detect a pattern, and this increases the chances of success of the guesser. Additionally, the fact that Even wins when there is a match gives him an advantage, since people are better at matching than at mismatching (due to the stimulus-response compatibility effect).

Moreover, when the payoff matrix is asymmetric, other factors influence human behavior even when the game is not repeated:

  • Players tend to increase the probability of playing an action which gives them a higher payoff, e.g. in the payoff matrix above, Even will tend to play more Heads. This is intuitively understandable, but it is not a Nash equilibrium: as explained above, the mixing probability of a player should depend only on the other player's payoff, not his own payoff. This deviation can be explained as a quantal response equilibrium.[5][6] In a quantal-response-equilibrium, the best-response curves are not sharp as in a standard Nash equilibrium. Rather, they change smoothly from the action whose probability is 0 to the action whose probability 1 (in other words, while in a Nash-equilibrium, a player chooses the best response with probability 1 and the worst response with probability 0, in a quantal-response-equilibrium the player chooses the best response with high probability that is smaller than 1 and the worst response with smaller probability that is higher than 0). The equilibrium point is the intersection point of the smoothed curves of the two players, which is different from the Nash-equilibrium point.
  • The own-payoff effects are mitigated by risk aversion.[7] Players tend to underestimate high gains and overestimate high losses; this moves the quantal-response curves and changes the quantal-response-equilibrium point. This apparently contradicts theoretical results regarding the irrelevance of risk-aversion in finitely-repeated zero-sum games.[8]

Real-life data

[edit]

The conclusions of laboratory experiments have been criticized on several grounds.[9][10]

  • Games in lab experiments are artificial and simplistic and do not mimic real-life behavior.
  • The payoffs in lab experiments are small, so subjects do not have much incentive to play optimally. In real life, the market may punish such irrationality and cause players to behave more rationally.
  • Subjects have other considerations besides maximizing monetary payoffs, such as to avoid looking foolish or to please the experimenter.
  • Lab experiments are short and subjects do not have sufficient time to learn the optimal strategy.

To overcome these difficulties, several authors have done statistical analyses of professional sports games. These are zero-sum games with very high payoffs, and the players have devoted their lives to become experts. Often such games are strategically similar to matching pennies:

  • In soccer penalty kicks, the kicker has two options – kick left or kick right – and the goalie has two options – jump left or jump right.[11] The kicker's probability of scoring a goal is higher when the choices do not match, and lower when the choices match. In general, the payoffs are asymmetric because each kicker has a stronger leg (usually the right leg) and his chances are better when kicking to the opposite direction (left). In a close examination of the actions of kickers and goalies, it was found[9][10] that their actions do not deviate significantly from the prediction of a Nash equilibrium.
  • In tennis serve-and-return plays, the situation is similar. It was found[12] that the win rates are consistent with the minimax hypothesis, but the players' choices are not random: even professional tennis players are not good at randomizing, and switch their actions too often.

See also

[edit]
  • Odds and evens - a game with the same strategic structure, that is played with fingers instead of coins.
  • Rock paper scissors - a similar game in which each player has three strategies instead of two.
  • Parity game - a much more complicated two-player logic game, played on a colored graph.
  • Penney's game - an exploitable sequence game

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
Matching pennies is a classic two-player, in , in which each participant simultaneously selects to display either the heads or tails side of a . The player designated as the matcher (typically Player 1) wins one unit from the other player if the choices match (both heads or both tails), while the mismatcher (Player 2) wins one unit if they differ. This simple setup results in a symmetric payoff matrix where outcomes are +1 for the winner and -1 for the loser in each scenario: heads-heads and tails-tails yield (1, -1), while heads-tails and tails-heads yield (-1, 1). The game has no pure strategy Nash equilibrium, as any predictable choice by one player can be exploited by the other to guarantee a win, highlighting the need for randomization in optimal play. Instead, the unique mixed strategy Nash equilibrium requires both players to randomize their choices equally, selecting heads or tails with 50% probability each, which yields an expected payoff of zero for both and ensures neither can unilaterally improve their outcome. This equilibrium is derived by setting the expected utilities equal for each pure strategy, making the opponent indifferent and preventing exploitation. Matching pennies serves as a foundational example for illustrating key concepts in , such as strategic interdependence, the limitations of deterministic strategies, and the role of mixed strategies in resolving games without dominant actions. It demonstrates how rational players maximize expected payoffs in purely competitive settings and has been extended to variants with asymmetric payoffs or multiple rounds to model real-world scenarios like market competition or . The game's impartial and symmetric makes it ideal for teaching the and the value of a game in zero-sum contexts.

Game Description

Basic Rules

Matching pennies is a two-player, in which one player, typically designated as the matcher (or Player 1), aims to match the choice of the other player, known as the mismatcher (or Player 2), who seeks to avoid the match. Each player simultaneously selects one of two options—heads (H) or tails (T)—by secretly positioning a or an equivalent token, without any communication between them. Upon simultaneous reveal, the matcher wins and receives +1 payoff if both players choose the same side (both H or both T), while the mismatcher wins and receives +1 payoff if the choices differ; in each case, the losing player receives -1. This structure ensures the game is strictly zero-sum, as the total payoff across both players always sums to zero. Originating as a simple parlor game long before the formal development of , matching pennies exemplifies basic simultaneous-move decision-making under uncertainty.

Payoff Matrix

The payoff structure of Matching Pennies is represented by a 2×2 bimatrix, where rows correspond to Player 1's actions (Heads or Tails), columns to Player 2's actions (Heads or Tails), and each cell contains the payoffs for both players in the form (payoff to Player 1, payoff to Player 2). This matrix quantifies the outcomes from the basic rules: Player 1 wins (+1) and Player 2 loses () if both choose Heads or both choose Tails, while Player 1 loses () and Player 2 wins (+1) otherwise. The game is zero-sum, as the payoffs in each cell sum to zero, reflecting pure where one player's gain equals the other's loss.
Player 2 \ Player 1HeadsTails
Heads(1, -1)(-1, 1)
Tails(-1, 1)(1, -1)
In its standard symmetric form, the game is impartial, with both players facing identical strategic options but opposing objectives.

Theoretical Analysis

Pure Strategies

In the matching pennies game, a pure strategy for each player consists of deterministically selecting one action—either always choosing Heads (H) or always choosing Tails (T)—without . This approach represents a fixed commitment to a single choice in every play of the game. To analyze pure strategy pairs, consider the best responses based on the game's payoff structure, where Player 1 (the matcher) receives +1 for a match and -1 for a mismatch, while Player 2 (the mismatcher) receives the opposite. If Player 1 commits to H, Player 2's optimal response is T, yielding +1 for Player 2 and -1 for Player 1. Conversely, if Player 1 commits to T, Player 2's best response is H, again resulting in +1 for Player 2 and -1 for Player 1. The same logic applies symmetrically: if Player 2 commits to H, Player 1 responds with H; if Player 2 commits to T, Player 1 responds with T. This mutual best-response dynamic reveals no stable pure strategy pair where both players' choices are simultaneously optimal against each other. Any fixed choice by one player invites exploitation by the opponent, leading to a cycle of adjustments: for instance, Player 1 choosing H prompts Player 2 to choose T, but anticipating this, Player 1 might switch to T, only for Player 2 to then switch to H. Consequently, the expected payoff under pure strategies is always disadvantageous for the player who commits first, guaranteeing -1 if the opponent responds optimally.

Mixed Strategies

In the matching pennies game, mixed strategies allow players to randomize their actions to address the instability of pure strategies. Player 1 chooses heads (H) with probability pp and tails (T) with probability 1p1 - p, while Player 2 chooses H with probability qq and T with probability 1q1 - q. The rationale for mixed strategies lies in introducing , which prevents an opponent from predictably exploiting a player's choice and ensures the opponent is indifferent between their own actions. To illustrate, the expected for Player 1 under these probabilities, assuming a payoff of +1 for a match and -1 for a mismatch, is: EU1=pq(1)+p(1q)(1)+(1p)q(1)+(1p)(1q)(1)EU_1 = p q (1) + p (1 - q) (-1) + (1 - p) q (-1) + (1 - p) (1 - q) (1) This expands and simplifies to EU1=p(2q1)+(1p)(12q)EU_1 = p(2q - 1) + (1 - p)(1 - 2q). Player 1 becomes indifferent between H and T when the expected utility of each pure strategy is equal, i.e., 2q1=12q2q - 1 = 1 - 2q, which holds at q=0.5q = 0.5. The game's symmetry implies that both players adopt identical mixing probabilities to maintain this balance of indifference.

Nash Equilibrium

In game theory, a Nash equilibrium is a strategy profile in which no player can improve their expected payoff by unilaterally deviating from their strategy, assuming all other players' strategies remain fixed. This concept, introduced by John Nash in his seminal 1951 paper on non-cooperative games, provides a foundational solution for analyzing strategic interactions like Matching Pennies. In the Matching Pennies game, there are no pure strategy Nash equilibria, as each player's best response to the other's pure strategy is to mismatch, leading to perpetual cycling. The unique Nash equilibrium is thus achieved through mixed strategies, where Player 1 chooses Heads with probability pp and Tails with probability 1p1-p, and Player 2 chooses Heads with probability qq and Tails with probability 1q1-q. To derive this equilibrium, consider Player 1's expected utility (EU) from playing Heads: EU1(H)=q1+(1q)(1)=2q1EU_1(H) = q \cdot 1 + (1-q) \cdot (-1) = 2q - 1. Similarly, EU1(T)=q(1)+(1q)1=12qEU_1(T) = q \cdot (-1) + (1-q) \cdot 1 = 1 - 2q. For Player 1 to be indifferent between Heads and Tails—ensuring no incentive to deviate—set EU1(H)=EU1(T)EU_1(H) = EU_1(T): 2q1=12q2q - 1 = 1 - 2q Solving yields 4q=24q = 2, so q=0.5q = 0.5. By symmetry, Player 2's indifference condition gives p=0.5p = 0.5. The resulting equilibrium profile is both players randomizing 50-50 over Heads and Tails, yielding an expected payoff of 0 for each player, confirming the game as fair and zero-sum. This mixed strategy equilibrium is unique due to the strictly competitive structure of Matching Pennies, where any deviation from 50-50 allows the opponent to exploit and gain a positive expected payoff.

Variants and Extensions

Asymmetric Versions

In asymmetric versions of the matching pennies game, the payoffs differ across players or action combinations, creating unequal incentives that modify the strategic dynamics from the symmetric case while maintaining the fundamental tension between matching and mismatching choices. These variants often feature non-zero-sum structures to capture behavioral effects or role-specific advantages, leading to mixed strategy equilibria where at least one player randomizes unevenly. A representative example, drawn from experimental designs in behavioral , uses the following payoff matrix (row player payoffs first, column player second):
LeftRight
Top32, 44, 8
Bottom4, 88, 4
Here, the row player benefits substantially from playing top against left (32) but faces low payoffs in other top combinations, while the bottom row offers more balanced but lower rewards. The unique mixed requires the row player to mix 50% top and 50% bottom, whereas the column player mixes 12.5% left and 87.5% right, reflecting the payoff asymmetry that favors right more heavily to keep the row player indifferent. To derive the column player's mixing probability qq (for left), set the row player's expected payoffs equal for indifference: Expected payoff for top: 32q+4(1q)=28q+432q + 4(1 - q) = 28q + 4 Expected payoff for bottom: 4q+8(1q)=84q4q + 8(1 - q) = 8 - 4q 28q+4=84q28q + 4 = 8 - 4q 32q=4    q=18=0.12532q = 4 \implies q = \frac{1}{8} = 0.125 In general, for matrices where the off-diagonal payoffs for the row player are equal (here, both 4), the column player's q=ba+bq = \frac{b}{a + b}, with aa as the top row payoff difference (324=2832 - 4 = 28) and bb as the bottom row payoff difference (84=48 - 4 = 4). This yields q=428+4=0.125q = \frac{4}{28 + 4} = 0.125, shifting from the 50-50 mix when asymmetries increase (e.g., larger aa reduces qq). The row player's uniform 50% mix stems from the symmetric incentives in the column player's payoffs. Such asymmetric formulations model real-world scenarios with inherent imbalances, such as resource disparities in economic conflicts or strategic advantages in biological predator-prey dynamics, where one agent's higher stakes in certain outcomes prompt adjusted randomization to exploit or counter opponent predictability.

Multi-Player Adaptations

In multi-player adaptations of the matching pennies game, N players simultaneously select heads (H) or tails (T), extending the two-player zero-sum conflict to group coordination where payoffs depend on the distribution of choices. A common setup is the odd-man-out variant, in which players contribute to a shared pot of fixed size x; if all choices match, the pot is split equally (x/N each). If there is a strict (e.g., k > N/2 players choose T), the majority players split the pot equally (x/k each), while minority players receive 0 and effectively lose their contribution. This structure pits individual incentives against group alignment, with the odd player(s) disadvantaged. Equilibrium analysis reveals heightened complexity compared to the two-player case, as mixed span multi-dimensional spaces but often simplify under . In symmetric zero-sum formulations, pure strategy equilibria are typically absent, since any unanimous choice (all H or all T) invites profitable deviation by a single player becoming the odd one out to capture the full pot or avoid loss. Instead, the unique symmetric involves each player randomizing with equal probability (p = 1/2 for T), rendering opponents indifferent and yielding an of zero for all players, consistent with zero-sum fairness. For non-symmetric payoff variants, equilibria require solving coupled best-response equations across N dimensions, with the game's value remaining zero in balanced cases. A representative three-player version illustrates majority matching determining the winner: each player chooses H or T, contributing to a pot normalized to 1. If all three match, payoffs are zero (no transfer). If two match and one mismatches (odd man out), the majority pair each gains 0.5, while the outlier loses 1 (paying 0.5 to each matcher). The symmetric mixed is p = 1/2 for T, with no pure equilibria, as deviation from unanimity allows the deviator to either join the or exploit as the odd one for gain. This setup highlights coordination challenges, where randomization prevents predictable matching. For larger N, computational challenges arise in deriving equilibria, particularly in asymmetric or multistage extensions, due to the exponential growth in joint action profiles (2^N possibilities) and the need to optimize high-dimensional mixed strategies via methods like linear programming or iterative best-response dynamics. While symmetric cases remain tractable with p = 1/2, perturbations (e.g., unequal contributions) demand numerical solutions, increasing time complexity to O(2^N) in worst cases without symmetry assumptions. The game's value approaches zero-sum impartiality as N grows, emphasizing fairness but amplifying solution difficulty.

Empirical Investigations

Laboratory Experiments

Laboratory experiments on the matching pennies game have primarily investigated players' ability to adhere to the mixed strategy , which requires randomizing choices with equal probability between heads and tails. These studies typically involve pairs of participants playing repeated rounds in controlled settings, with monetary incentives to encourage strategic play. Sessions often consist of 50 to 100 rounds per pair, allowing researchers to observe learning and adaptation over time. A seminal study by Ochs (1995) examined asymmetric variants of matching pennies, where payoffs differ slightly between players, leading to equilibrium mixing probabilities deviating from 50-50. Participants showed persistent over-matching behavior, favoring the action that aligned with their higher payoff more than predicted, even after 50 rounds of repetition. This deviation from suggests that humans struggle with true randomization, instead exhibiting predictable patterns influenced by payoff asymmetries. Goeree and Holt (2001) further explored information effects in matching pennies experiments, finding that players' choices are strongly affected by their own payoffs—a phenomenon termed "own-payoff effects"—resulting in mixing rates that deviate from equilibrium predictions. In their sessions with up to 100 rounds and financial stakes, subjects often employed simple heuristics like win-stay-lose-shift, repeating successful actions and switching after losses, which led to initial non-random play but gradual convergence toward near-50-50 mixing over repetitions. Average mixing proportions across these studies typically range from 45% to 55%, with learning processes improving the degree of randomness in later rounds. A 2023 study by Leng et al. revisited asymmetric matching pennies in and found participants' behavior closer to equilibrium mixing probabilities than in earlier Western samples, suggesting potential cultural or contextual influences on randomization adherence.

Real-Life Data

In real-life contexts, matching pennies manifests in informal street games and scams, such as the smack confidence trick, where two operators lure a victim into betting on matching a visible flip, often leading to rigged mismatches that exploit the victim's attempts to predict or copy outcomes. Observational data from analogous zero-sum games like rock-paper-scissors, played naturalistically via online platforms, reveal deviations from theoretical . In a of over 2.6 million matches from a application, players' choice frequencies approximated the uniform distribution (rock: 33.99%, paper: 34.82%, : 31.20%), but 18% of experienced players (with at least 100 matches) significantly deviated from equal probabilities, often showing history-dependent patterns such as increased rock selection following opponents' higher rates. These patterns suggest superstitious influences in casual play, where prior losses or opponent behaviors prompt non-random adjustments, like favoring certain choices after sequences of defeats, rather than strict equilibrium play. Specific analyses of such tournaments indicate partial , with approximately 47% of experienced players reacting strategically to historical , achieving win rates around 34.66% compared to the expected 33.33% under perfect mixing. Challenges in these uncontrolled settings include factors like player experience and platform incentives, yet recurring patterns across thousands of interactions point to gradual convergence toward mixed-strategy equilibria despite initial biases. As counterparts to controlled experiments, these naturalistic observations highlight persistent human tendencies toward predictability in high-stakes, repeated encounters.

Applications and Significance

In Economics and Decision-Making

In economics, the matching pennies models anti-coordination scenarios where one agent's optimal action requires predicting and countering the opponent's choice, leading to as a core strategy. A prominent application is in Bertrand price competition with capacity constraints, where firms producing homogeneous goods randomize prices to mismatch rivals and capture , resulting in mixed strategy equilibria that distribute prices over an interval to ensure indifference. For instance, in the Bertrand-Edgeworth model, firms mix continuously over prices above , preventing pure strategy undercutting and sustaining positive profits despite competition. The game also informs entry models with fixed costs, where potential entrants simultaneously decide whether to enter the market, randomizing their entry probabilities to make opponents indifferent and yielding equilibria with zero expected profits in duopoly settings. This structure highlights how mixed strategies create uncertainty in competitive entry decisions. Key insights from matching pennies extend to auctions, where bidders randomize to obscure valuations and avoid exploitation through predictable patterns; for example, in first-price sealed-bid auctions, mixed strategies over bid distributions ensure opponents cannot infer true , promoting fairness in allocation. In for fair allocation, variants like the matching companies game—where firms choose products to mismatch for payoffs—guide the creation of incentive-compatible rules that induce , achieving efficient social choice functions in quasilinear environments without dictatorial outcomes. Within , the game's mixed strategies illustrate how disrupts by injecting unpredictability into pricing or output decisions, as rivals' indifference prevents coordinated high prices and fosters competitive equilibria. In modern , post-2010 analyses model high-frequency traders as playing repeated anti-coordination games, randomizing order placements and timings to evade front-running or exploitation by opponents' predictive algorithms, with equilibria balancing liquidity provision against risks. Empirical support from validates these models, showing randomization correlates with reduced predictability in trade flows.

In Psychology and Neuroscience

In , the matching pennies game has been employed to investigate automatic and its conflict with strategic incentives to avoid . A seminal study demonstrated that players exhibit unintended of opponents' gestures despite strong monetary incentives to mismatch, suggesting an automatic process rooted in social learning mechanisms that operates independently of conscious strategic intent. This avoidance highlights how cognitive biases toward social can undermine optimal decision-making in competitive interactions. Neuroscience research using (fMRI) has revealed (PFC) involvement in generating mixed strategies during matching pennies, particularly in resolving and updating choices based on . Human fMRI findings show medial PFC activation during mentalizing opponents' intentions in the game, correlating with strategic depth and expectations of opponent behavior amid . These activations underscore the PFC's role in balancing predictability and in . Behavioral investigations reveal cultural variations in randomization tendencies during matching pennies. A 2023 study in found that players in asymmetric versions produced mixing rates closer to equilibrium predictions than earlier Western samples, suggesting contextual or cultural factors influence deviation from rational play and highlighting the need for validation of behavioral anomalies. Behavioral studies using matching pennies reveal that impulsive traits in predict differences in choice behavior and reward rates during play.

References

Add your contribution
Related Hubs
User Avatar
No comments yet.