Modified: November 16, 2022
kelly criterion
This page is from my personal notes, and has not been specifically reviewed for public consumption. It might be incomplete, wrong, outdated, or stupid. Caveat lector.We are given the opportunity to bet some fraction of our wealth on a coin flip with probability . We can repeat this as many times as we like. How much should we bet?
This seems like a tricky question because it has the form of a sequential decision problem: we can imagine a branching tree of decisions for how much to bet at each step, depending on how the previous bets have gone. Just as finding the optimal first move in chess would require solving the entire game, in the worst case determining the optimal first bet might require gaming out its potential repercussions into the infinite future.
The problem is also underspecified: the answer will depend on what our utility function is. However there are (at least) two natural choices of utility function for which the sequential decision problem simplifies dramatically: linear and logarithmic utility.
Formally, let be a random variable for our wealth after steps, and represent the coinflip at step . If at each step we bet fraction of our current wealth, then our wealth will be multiplied by , that is, either or depending on the outcome of the coinflip. Then our wealth after steps will grow as
In the case of linear utility, we try to maximize
where we can move the expectation inside the product because each step is independent of the others. This shows that we can simply maximize independently at each step, which simply means taking (betting the entire bankroll) at all steps assuming that the 's have positive expectation. This is a counterintuitive strategy, since in the vast majority of cases we will lose everything, but those losses are offset by the enormous compounding of wealth in the world where we win all bets (which happens with probability ).
The other (and debatably more natural) assumption is that of logarithmic utility, where we try to maximize
Taking logs converts compounding into a simple sum, where linearity of expectation works so that we need only consider the expectation at a single step, i.e., we need only maximize the expected growth rate
This can be solved by straightforward calculus, setting the derivative with respect to equal to zero,
and solving for the optimal bet (the Kelly criterion),
Let's plug in numbers for intuition. If , the Kelly criterion says that we should bet of our wealth at each round. On the other hand, if , we should bet of our wealth.
Alternative arguments for the Kelly criterion
There are some alterative goals that also lead to the Kelly criterion: for example, maximizing wealth in the median outcome. See: