Proofs of a Trembling-hand and not a Trembling-hand Perfect Equilibrium in Selten’s Example

I provide proofs of why one Nash equilibrium in Selten’s Example is not a trembling-hand perfect equilibrium and why another is.

Reinhard Selten, famous for introducing the trembling-hand perfect equilibrium into the world of game theory provided a numerical example of why his refinement concept of Nash equilibrium was relevant (Selten & Bielefeld, 1988). In his example, he showed that the Nash equilibrium (R,ρ,A)(R, \rho, A) is a trembling-hand perfect equilibrium and stated that the Nash equilibrium (L,ρ,B)(L, \rho, B) was not. When reading through his analysis and proof of why this was the case, I was confused about the latter.

In this blog post, for my understanding of this equilibrium concept, I try to provide an alternative proof for why (R,ρ,A)(R, \rho, A) is a Trembling-hand perfect equilibrium (different than the proof that Selten and Börgers provides) and a, hopefully, clearer proof by contradiction for why (L,ρ,B)(L, \rho, B) is not a Trembling-hand perfect equilibrium.

Illustration of Selten's Example. This graphic is from Tilman Börger's game theory notes.

A Trembling-hand and not a Trembling-hand Perfect Equilibrium

I provide the definition of Trembling-hand Perfect Equilibrium in order to show Selten’s two important results. Note that this definition extends the definition provided in Börger’s notes (that is I list an extra condition for Trembling-hand Perfect Equilibrium) but is consistent with Selten’s definition.

Definition. Trembling-hand Perfect Equilibrium\textbf{Definition. Trembling-hand Perfect Equilibrium}

A list of behavior strategies b=(b1,b2,,bn)b^{*}=\left(b_{1}^{*}, b_{2}^{*}, \ldots, b_{n}^{*}\right) is a (trembling hand) perfect equilibrium if there exists a sequence of lists of perturbations

(δk)kN=(δ1k,δ2k,,δnk)kN \left(\delta^{k}\right)_{k \in \mathbb{N}}=\left(\delta_{1}^{k}, \delta_{2}^{k}, \ldots, \delta_{n}^{k}\right)_{k \in \mathbb{N}}

and a sequence of lists of behavior strategies

(bk)kN=(b1k,b2k,,bnk)kN \left(b^{k}\right)_{k \in \mathbb{N}}=\left(b_{1}^{k}, b_{2}^{k}, \ldots, b_{n}^{k}\right)_{k \in \mathbb{N}}

(note that each player’s behavior strategies may depend on the perturbation) such that for every player iIi \in I,

(1) for every information set VijV_{i}^{j} of player ii, and every action aiAi(Vij),limkδik(ai)=0a_{i} \in A_{i}\left(V_{i}^{j}\right), \lim _{k \rightarrow \infty} \delta_{i}^{k}\left(a_{i}\right)=0, (higher kk modeling less tremble and lower kk modeling more tremble)
(2) for every kN,bkk \in \mathbb{N}, b^{k} is a Nash equilibrium of Γ(δk)\Gamma\left(\delta^{k}\right),
(3) limkbik=bi\lim _{k \rightarrow \infty} b_{i}^{k}=b_{i}^{*}, and
(4) for every kNk \in \mathbb{N}, information set VijV_{i}^{j} of player ii, and aiAi(Vij),δik(ai)bik(ai)1aiAi(vij){ai}δik(ai)a_{i} \in A_{i}\left(V_{i}^{j}\right), \delta_{i}^{k}\left(a_{i}\right) \leq b_{i}^{k}\left(a_{i}\right) \leq 1-\sum_{a_{i}^{\prime} \in A_{i}\left(v_{i}^{j}\right) \setminus \left\{a_{i}\right\}} \delta_{i}^{k}\left(a_{i}^{\prime}\right) (there is a lower and upper bound for the behavior strategy determined by the perturbation since we require there to be a small amount of error for each action).

Proposition.\textbf{Proposition.} (R,ρ,A)(R, \rho, A) is a Trembling-hand perfect equilibrium.

Proof.\textit{Proof.} For the behavior strategy (R,ρ,A)(R, \rho, A) to be a Trembling-hand perfect equilibrium, there must exist a sequence of lists of perturbations and behavior strategies such that the conditions listed in the definition are satisfied. Let ϵ(0,0.25)\epsilon \in(0,0.25) and consider the following sequence of behavior strategies along with the perturbations. Suppose

b(A)=1ϵk,b(B)=ϵk,b(λ)=ϵk,b(ρ)=1ϵk,b(L)=ϵk2+ϵk,b(R)=1ϵk2+ϵk, b(A)=1-\epsilon^{k}, b(B)=\epsilon^{k}, b(\lambda)=\epsilon^{k}, b(\rho)=1-\epsilon^{k}, b(L)=\frac{\epsilon^{k}}{2+\epsilon^{k}}, b(R)=1-\frac{\epsilon^{k}}{2+\epsilon^{k}},

δ(B)=δ(λ)=δ(A)=δ(ρ)=δ(R)=ϵk\delta(B)=\delta(\lambda)=\delta(A)=\delta(\rho)=\delta(R)=\epsilon^{k}, and δ(L)=ϵk2+ϵk\delta(L)=\frac{\epsilon^{k}}{2+\epsilon^{k}}. Conditions (1) and (3) are satisfied in this case. Now consider condition (4). For b(L)b(L), we must have that

δ(L)ϵk2+ϵk1δ(R). \delta(L) \leq \frac{\epsilon^{k}}{2+\epsilon^{k}} \leq 1-\delta(R).

For b(λ)b(\lambda), we must have that

δ(λ)ϵk1δ(ρ). \delta(\lambda) \leq \epsilon^{k} \leq 1-\delta(\rho).

For b(A)b(A), we must have that

δ(A)1ϵk1δ(B). \delta(A) \leq 1-\epsilon^{k} \leq 1-\delta(B).

Note that I don’t check b(R)b(R) or the others since the constraint δ(R)b(R)=1b(L)1\delta(R) \leq b(R)=1-b(L) \leq 1- δ(L)\delta(L) is equivalent to the constraint for b(L)b(L). Hence, after checking the constraints above, condition (4) is satisfied. Now I check condition (2). The following is true for all kNk \in \mathbb{N}. For player 1,R1, R is preferred to LL if

3bk(B)bk(ρ)+4(1bk(ρ))bk(B)3ϵk1ϵk+4(ϵk)2. \begin{gathered} 3 b^{k}(B) \leq b^{k}(\rho)+4\left(1-b^{k}(\rho)\right) b^{k}(B) \\ \Leftrightarrow \\ 3 \epsilon^{k} \leq 1-\epsilon^{k}+4\left(\epsilon^{k}\right)^{2}. \end{gathered}

For player 2,ρ2, \rho is preferred to λ\lambda if

4bk(B)1ϵk14. \begin{gathered} 4 b^{k}(B) \leq 1 \\ \Leftrightarrow \\ \epsilon^{k} \leq \frac{1}{4}. \end{gathered}

For player 3, AA is preferred to BB if

2(1bk(R))bk(R)(1bk(ρ))2(ϵk2+ϵk)(1ϵk2+ϵk)ϵk. \begin{gathered} 2\left(1-b^{k}(R)\right) \leq b^{k}(R)\left(1-b^{k}(\rho)\right) \\ \Leftrightarrow \\ 2\left(\frac{\epsilon^{k}}{2+\epsilon^{k}}\right) \leq \left(1-\frac{\epsilon^{k}}{2+\epsilon^{k}}\right) \epsilon^{k}. \end{gathered}

Hence, condition (2) is satisfied.

Proposition.\textbf{Proposition.} (L,ρ,B)(L, \rho, B) is not a Trembling-hand perfect equilibrium.

Proof.\textit{Proof.} Suppose for the sake of contradiction that (L,ρ,B)(L, \rho, B) is a perfect equilibrium. Hence, there is a sequence of behavior strategies bk(B)b^{k}(B) for player 3 such that player 2 prefers ρ\rho over λ\lambda. Since condition (2) is satisfied, we must have that for player 2, for all k,1>4bk(B)k, 1>4 b^{k}(B) (i.e. the expected value of player 2 choosing ρ\rho is greater than the expected value of player 2 choosing λ\lambda). Note that this argument is valid because there is no Nash equilibrium where player 2 chooses λ\lambda with probability 1. Further note that we cannot have bk(B)=14b^{k}(B)=\frac{1}{4}, since that would violate condition (3). Hence, player 3’s best response must be bk(B)<14b^{k}(B)<\frac{1}{4}. But this then violates condition (3) which requires that limkbk(B)=1\lim _{k \rightarrow \infty} b^{k}(B)=1. Hence, we have arrived at a contradiction and (L,ρ,B)(L, \rho, B) cannot be a Trembling-hand perfect equilibrium.

References

  1. Selten, R., & Bielefeld, R. S. (1988). Reexamination of the perfectness concept for equilibrium points in extensive games. Springer.