Part 1: Starting from Zero
Part 2: Meet Eustace
Part 3: Bernoulli Trials
Part 4: Why Randomness is Not All Equally Random
Have you ever been told what to do with your life? A particular college, major, grad school, career? Isn’t it annoying, that someone would presume to plot out your life for you, as if you had no say in the matter? Probability theory has a term for this (the plotting, not the annoyance): strong solution.
A strong solution is any specified trajectory for a random process. In our coin flipping game it would be the realized sequence of heads and tails. Of course Eustace can’t know such a path in advance. The best he can do is to construct a distribution of possible outcomes. This distribution is a weak solution, which is defined, not by its path, which is unknown, but only by the moments of a probability distribution. If Eustace knows a random process is stationary, he has confidence that the moments of the process will converge to the same values every time. The coin flipping game, for instance, is stationary: its long term average winnings, given a fair coin, will always converge to zero. Looking into an uncertain future, Eustace is always limited to a weak solution: it is specified by the expectations, or moments, of the underlying random process. The actual path remains a mystery.
So far we haven’t given poor Eustace much help. A weak “solution” is fine for mathematics; but being a mere cloud of possibilities, it is, from Eustace’s point of view, no solution at all. (A Eustace entranced by the weak solution is what we commonly call a perfectionist.) Sooner rather than later, he must risk a strong solution. He must chart a course: he must act.
Well then, on what basis? Probability theory has a term for this too. The accumulated information on which Eustace can base his course is called a filtration. A filtration represents all information available to Eustace when he chooses a course of action. Technically, it is an algebra (sigma algebra) defined on an increasing set of information (Borel set). The more of the available filtration Eustace uses, the better he does in the casino.
In the coin flipping game, Eustace’s filtration includes, most obviously, the record of the previous flips. Of course in this case the filtration doesn’t help him predict the next flip, but it does help him predict his overall wins and losses. If Eustace wins the first flip (t=1), he knows that after the next flip (t=2), he can’t be negative. This is more information than he had when he started (t=0). If the coin is fair, Eustace has an equal likelihood of winning or losing $1. Therefore, the expected value of his wealth at any point is simply what he has won up to that point. The past reveals what Eustace has won. The future of this stationary distribution is defined by unchanging moments. In a fair game, Eustace can expect to make no money no matter how lucky he feels.
His filtration also includes, less obviously, the constraints of the game itself. Recall that if he wins $100 he moves to a better game and if he loses $100 he’s out in the street. To succeed he must eliminate paths that violate known constraints; a path to riches, for instance, that requires the casino to offer an unlimited line of credit is more likely a path to the poorhouse.
We can summarize all of this with two simple equations:
E(wealth@t | F@t-1) = wealth@t-1 (first moment)
variance(wealth@t|F@t-1) = 1 (second moment)
The expected wealth at any time t is simply the wealth Eustace has accumulated up until time t-1. E is expected value. t is commonly interpreted as a time index. More generally, it is an index that corresponds to the size of the filtration, F. F accumulates the set of distinguishable events in the realized history of a random process. In our coin game, the outcome of each flip adds information to Eustace’s filtration.
We have also assumed that when Eustace’s wealth reaches zero he must stop playing. Game over. There is always a termination point, though it need not always be zero; maybe Eustace needs to save a few bucks for the bus ride home. Let’s give this point a name; call it wealthc (critical). Introducing this term into our original equation for expected wealth, we now have:
max E(wealth@t – wealthc | F@t-1)
His thermodynamic environment works the same way. In the casino, Eustace can’t blindly apply any particular strong solution — an a priori fixed recipe for a particular sequence of hits and stands at the blackjack table. Each card dealt in each hand will, or should, influence his subsequent actions in accordance with the content of his filtration. The best strategy is always the one with max E(wealth@t|F@t-1) at each turn. In this case, F@t-1 represents the history of dealt cards.
As Eustace graduates to higher levels of the casino, the games become more complex. Eustace needs some way of accommodating histories: inflexibility is a certain path to ruin. Card-counters differ from suckers at blackjack only by employing a more comprehensive model that adapts to the available filtration. They act on more information — the history of the cards dealt, the number of decks in the chute, the number of cards that are played before a reshuffle. By utilizing all the information in their filtration, card counters can apply the optimal strong solution every step of the way.
In the alpha casino, Eustace encounters myriad random processes. His ability to mediate the effects of these interactions is a direct consequence of the configuration of his alpha model. The best he can hope to do is accommodate as much of the filtration into this model as he can to generate the best possible response. Suboptimal responses will result in smaller gains or greater losses of alpha. We will take up the policy implications, as one of my readers put it, of all this in Part 6.
Disclaimer: Although I use the language of agency — to know, to act, to look into the future — nothing in this discussion is intended to impute agency, or consciousness, or even life, to Eustace. One could speak of any inanimate object with a feedback mechanism — a thermostat, a coffeemaker — in exactly the same way. Unfortunately English does not permit discussing these matters in any other terms. Which is why I sometimes want to run shrieking back to my equations. You may feel otherwise.