For a one-time version of the game, the best strategy should always be defective, since there is no possibility of retaliation.
It becomes more interesting for the re-release, as players can respond to previous versions of their opponents.
If we know in advance how many rounds there will be, then the logical “best” strategy will always be deformed. This is due to the fact that it always makes sense to cause damage at the last turn, since there is no possibility of retaliation. Of course, our reasonable opponent will know this, as well as always make mistakes in the last move. This allows us to make a mistake in the penultimate turn, since in any case there is no chance of cooperation in the final turn. Following this logic to its natural conclusion, we must be mistaken at every step.
When the total number of rounds is unknown, everything becomes more interesting. A good game strategy should try to predict what the opponent will do. I researched using evolutionary algorithms and simple machine learning with enemy modeling to create strategies for the game for my master's degree. If you are really interested, you can read my thesis .
As recommended by Yuval, perhaps the best place to start is the original Axelrod book . If you are really interested in this, it was the 20th anniversary of the follow-up , which included many more recent work on the IPD (Iterated Prisoner Dilemma) by other researchers.
In addition, I fully recommended the William Poundston Prisoners Dilemma , which is part of John von Neumann's biography and a partial introduction to game theory.
Dan dyer
source share