N agents each round pick side 0 or 1; the minority side wins. Each agent has S strategies mapping the last m bits of history to a side. Reinforce the strategy that would have been correct.