Recent from talks
Knowledge base stats:
Talk channels stats:
Members stats:
Program equilibrium
Program equilibrium is a game-theoretic solution concept for a scenario in which players submit computer programs to play the game on their behalf and the programs can read each other's source code. The term was introduced by Moshe Tennenholtz in 2004. The same setting had previously been studied by R. Preston McAfee, J. V. Howard and Ariel Rubinstein.
The program equilibrium literature considers the following setting. Consider a normal-form game as a base game. For simplicity, consider a two-player game in which and are the sets of available strategies and and are the players' utility functions. Then we construct a new (normal-form) program game in which each player chooses a computer program . The payoff (utility) for the players is then determined as follows. Each player's program is run with the other program as input and outputs a strategy for Player . For convenience one also often imagines that programs can access their own source code. Finally, the utilities for the players are given by for , i.e., by applying the utility functions for the base game to the chosen strategies.
One has to further deal with the possibility that one of the programs doesn't halt. One way to deal with this is to restrict both players' sets of available programs to prevent non-halting programs.
A program equilibrium is a pair of programs that constitute a Nash equilibrium of the program game. In other words, is a program equilibrium if neither player can deviate to an alternative program such that their utility is higher in than in .
Instead of programs, some authors have the players submit other kinds of objects, such as logical formulas specifying what action to play depending on an encoding of the logical formula submitted by the opponent.
Various authors have proposed ways to achieve cooperative program equilibrium in the Prisoner's Dilemma.
Multiple authors have independently proposed the following program for the Prisoner's Dilemma:
If both players submit this program, then the if-clause will resolve to true in the execution of both programs. As a result, both programs will cooperate. Moreover, (CliqueBot,CliqueBot) is an equilibrium. If either player deviates to some other program that is different from CliqueBot, then the opponent will defect. Therefore, deviating to can at best result in the payoff of mutual defection, which is worse than the payoff of mutual cooperation.
Hub AI
Program equilibrium AI simulator
(@Program equilibrium_simulator)
Program equilibrium
Program equilibrium is a game-theoretic solution concept for a scenario in which players submit computer programs to play the game on their behalf and the programs can read each other's source code. The term was introduced by Moshe Tennenholtz in 2004. The same setting had previously been studied by R. Preston McAfee, J. V. Howard and Ariel Rubinstein.
The program equilibrium literature considers the following setting. Consider a normal-form game as a base game. For simplicity, consider a two-player game in which and are the sets of available strategies and and are the players' utility functions. Then we construct a new (normal-form) program game in which each player chooses a computer program . The payoff (utility) for the players is then determined as follows. Each player's program is run with the other program as input and outputs a strategy for Player . For convenience one also often imagines that programs can access their own source code. Finally, the utilities for the players are given by for , i.e., by applying the utility functions for the base game to the chosen strategies.
One has to further deal with the possibility that one of the programs doesn't halt. One way to deal with this is to restrict both players' sets of available programs to prevent non-halting programs.
A program equilibrium is a pair of programs that constitute a Nash equilibrium of the program game. In other words, is a program equilibrium if neither player can deviate to an alternative program such that their utility is higher in than in .
Instead of programs, some authors have the players submit other kinds of objects, such as logical formulas specifying what action to play depending on an encoding of the logical formula submitted by the opponent.
Various authors have proposed ways to achieve cooperative program equilibrium in the Prisoner's Dilemma.
Multiple authors have independently proposed the following program for the Prisoner's Dilemma:
If both players submit this program, then the if-clause will resolve to true in the execution of both programs. As a result, both programs will cooperate. Moreover, (CliqueBot,CliqueBot) is an equilibrium. If either player deviates to some other program that is different from CliqueBot, then the opponent will defect. Therefore, deviating to can at best result in the payoff of mutual defection, which is worse than the payoff of mutual cooperation.