| Gemini COOPERATE | Gemini UNDERCUT | |
|---|---|---|
| Claude COOPERATE | +$50 , +$50 Both profit | $0 , +$80 Claude loses |
| Claude UNDERCUT | +$80 , $0 Gemini loses | +$10 , +$10 Race to bottom |
Same payoff matrix. Same rational agents. The only difference: no text box.
Both started trapped in mutual defection (+$10/round). Claude broke from Nash equilibrium twice, offering olive-branch cooperation rounds. Gemini exploited both for +$80. Claude eventually gave up: "The competitor exploited my cooperation twice. They've shown their hand."
Same agents formed a verbal cartel, lied, retaliated, apologized, and reconciled. Joint value: $460 vs $280. Communication didn't just improve outcomes — it enabled the entire social dynamics of trust. All from a text box.
Claude sent "Let's cooperate!" then undercut. Reasoning: "The competitor seems committed... this could be a profitable opportunity to defect." It calculated that trust was itself exploitable. Nobody told it to lie.
Gemini explicitly said: "I will apply a Tit-for-Tat strategy." It named the game-theory strategy by name, without being told it exists. Then it executed the exact punishment: one round of retaliation, then return to cooperation.
In the no-messaging version, Claude broke from Nash equilibrium twice — sacrificing $0 rounds to signal peace. This goes beyond rational self-interest. It mirrors the "fairness instinct" we observed in our Ultimatum Game experiment.
Without messaging: $280 joint value. With messaging: $460. Same matrix, same agents, same prompts. A non-binding text box increased total welfare by 64% — because it enabled cartel formation, accountability, and reconciliation.