Modified: September 12, 2023
cooperative game
This page is from my personal notes, and has not been specifically reviewed for public consumption. It might be incomplete, wrong, outdated, or stupid. Caveat lector.References:
- https://en.wikipedia.org/wiki/Cooperative_game_theory
- https://en.wikipedia.org/wiki/Shapley_value
- conversations with GPT-4
A cooperative game consists of a set of players, called the grand coalition, and a characteristic function , which specifies the payoff achieved by cooperation among any subset of the players. The characteristic function satisfies , and is usually assumed to be superadditive, meaning that the union of disjoint coalitions has value at least as great as those coalitions individually: for when .
Superadditivity can be thought of as the assumption that cooperation is generally beneficial, perhaps due to economies of scale or synergies between players. It implies that the greatest possible value is realized when the set of all players (the grand coalition) works together.
I think that the mental model for thinking about cooperative game theory is that cooperation is the only possible interaction between the players: in the absence of a coalition, the players can all revert to collecting their individual payoffs. Under this assumption, superadditivity is the only incentive to cooperation, because plain old 'additive' payoffs are still available without cooperation. This would not be true if the underlying world in the absence of competition is actually competitive: if failing to form a coalition leaves the players in the situation of questing after some rivalrous good, so that the individual payoffs are not additive but perhaps even mutually exclusive.
A stronger condition than superadditivity is supermodularity: where the reward to joining a coalition increase with the size of the coalition, so that cooperation tends to 'snowball': for all . A game with a supermodular characteristic function is called a convex game.
Solution concepts
In cooperative game theory the main task is to allocate the payoff for the grand coalition . Most solution concepts begin with the notation of an imputation. This is any vector allocating payoffs to individual players that satisfies the basic properties of
- efficiency: summing to the total payoff, , and
- individual rationality: giving each player at least as much as they would get on their own, .
Stable solutions and the core
We would ideally like a solution to have some notion of stability. A basic notion is that no subset of players can do better by breaking off on their own. The set of imputations satisfying this property is called the core. Formally, an imputation is in the core if for all subsets . We can think of the core as the set of all imputations under which players are incentivized to form the grand coalition.
Some properties of the core:
- It may be empty. For example, if there are three workers and a heavy load that requires only two workers to carry, then any two of the workers can split off and get the whole payoff to themselves. So there is no stable way to allocate payoff to all three workers.
- We can of course just give worker 3 a payoff of zero, effectively cutting them out of the coalition: is a valid imputation. But is not stable, since worker 3 could counter-offer with something like to form the coalition cutting out worker 1 , which worker 2 would accept. And so on. There would be a stable solution if we had one fewer worker, but there is no stable way to decide which worker to cut out.
- It is defined by a set of linear inequalities on payoff vectors, so it is always closed and convex. And it can be solved for as the solution to a set of linear inequalities (i.e., a linear programming problem with a constant objective).
- The Bondareva-Shapley theorem says that the core exists if and only if a game is balanced in a particular sense. I don't really understand if this is supposed to provide any extra intuition, or just add technical difficulty.
Shapley values and fair allocation
The core is a set of imputations which are stable, but as a concept it doesn't tell us which of these imputations bargainers will or 'should' converge on.
Some other properties we might hope for in a solution include:
- symmetry: players that provide identical value should receive identical value. Formally, if adding player to a coalition always leads to the same value as adding player (for all coalitions not already containing one of those players), then we should distribute payoffs equally, .
- null player: a player that never adds value (formally, if for all coalitions ) should receive nothing.
- linearity: combining two separate games described by and into a new game with payoffs should lead to a distribution equal to the sum of the original distributions.
The Shapley value is the unique payoff rule that satisfies these properties (as well as various others). It is defined as the average value that a player adds when joining the coalition, taken over all possible orders (permutations of players) in which the grand coalition can be formed:
where is the set of predecessors of in the permutation (for example, if and , then are the players that precede player 4 in this permutation).
Note that the Shapley value can be outside the core. In the example above where three workers are doing a task that only requires two workers, symmetry of the workers implies that the Shapley value is , even though this is not a stable distribution (and indeed, no stable distribution is possible).