Learning in Two-Player Matrix Games

首页 > 代码库 > Learning in Two-Player Matrix Games

Learning in Two-Player Matrix Games

2024-08-14 17:37:12 211人阅读

3.2 Nash Equilibria in Two-Player Matrix Games

For a two-player matrix game, we can set up a matrix with each element containing a reward for each joint action pair. Then the reward function 技术分享 for player becomes a matrix.

A two-player matrix game is called a zero-sum game if the two player are fully competitive. In this way, we have 技术分享 . A zero-sum game has a unique NE in the sense of the expected reward. This means that, although each player may have multiple NE strategies in a zero-sum game, the value of the expected reward under these NE strategies will be the same. A general-sum matrix game refers to all types of matrix games. In a general-sum matrix game, the NE is no longer unique and the game might have multiple NEs.

For a two-player matrix game, we define 技术分享 as the set of all probability distributions over player ‘s action set . Then becomes

技术分享 (1)

An NE for a two-player matrix game is the strategy pair 技术分享 for two players such that, for

技术分享 (2)

where 技术分享 denotes any other player than player , and is the set of all probability distributions over player ‘s action set .

Given that each player has two actions in the game, we can define a two-player two-action general-sum game as

技术分享 (3)

where 技术分享 and denote the reward to the row player (player 1) and the reward to the column player (player 2), respectively. The row player chooses action and the column player chooses action . the pure strategies and are called a strict NE in pure strategies if

技术分享 (4)

where 技术分享 and denote any row other than row and any column other than column ,respectively.

Learning in Two-Player Matrix Games

声明：以上内容来自用户投稿及互联网公开渠道收集整理发布，本网站不拥有所有权，未作人工编辑处理，也不承担相关法律责任，若内容有误或涉及侵权可进行投诉：投诉/举报工作人员会在5个工作日内联系你，一经查实，本站将立刻删除涉嫌侵权内容。

联系
我们

首页 > 代码库 > Learning in Two-Player Matrix Games

Learning in Two-Player Matrix Games

看完仍有疑问？有类似问题直接问程序猿