NONCOOPERATIVE GAME THEORY
ECE594D — Fall 2007

Enrollment Code 54098, 1-5 units

Tue/Thu 2:00-3:50pm @ Phelps 1437

Course description

In optimization, one attempts to find values for parameters that minimize a suitably defined criterion (such as monetary cost, energy consumption, heat generated, etc.) However, in most engineering applications there is always some uncertainty as to how the selected parameters will affect the final objective. One can then pose the problem of how to make sure that the selection will lead to acceptable performance, even in the presence of some degree of uncertainty. This question is at the heart of most zero-sum games that appear in engineering applications. In fact, game theory provides the mathematical framework for robust design in engineering.

[ROCK][PAPER][SCISSORS]

Modern game theory was born in the 30’s, mostly propelled by the work of John von Neumann, further refined by Morgenstern, Kuhn, Nash, Shappley and others. Throughout most of the 40’s and 50’s, Economics was its main application, eventually leading to the 1994 Nobel prize in Economic Science awarded to John Nash, John C. Harsanyi, and Reinhard Selten for their contributions to Game Theory. It was not until the 70s that it started to have a significant impact on engineering and in the late 80’s it made a major splash in control theory and robust filtering.

The purpose of this course is to teach students to formulate problems as mathematical games and provide the basic tools to solve them. The course covers:

·         Static games, starting with two-player zero-sum games and eventually building up to n-player non-zero sum games. Saddle-points, Nash equilibria, and Stackelberg solutions will be covered.

·         Dynamic optimization (dynamic programming) for discrete and continuous time.

·         Dynamic games, both open and closed-loop policies. Saddle-points and Nash equilibria will be covered.

The intended audience includes (but is not restricted to) students in communications, controls, signal processing, and computer science. The class will be entirely project-oriented (no exams!) and the students are strongly encouraged to choose a project that is relevant to their own area of research.

Further information (including a detailed syllabus) is available on the web at

http://www.ece.ucsb.edu/~hespanha/ece594d/.

Prerequisites

ECE 210A Matrix Analysis and Computation: Graduate level-matrix theory with introduction to matrix analysis and computations: SVD's, pseudo-inverses, variational characterization of eigenvalues, perturbation theory (e.g., ECE 210A).

A review of basic probability theory is very much encouraged!

Instructor

João P. Hespanha (hespanha at ece.ucsb.edu), phone: (805) 893-7042, office: Engineering I, 5157.

Office hours:     Please email or phone in advance to schedule an appointment.

Textbook

The main textbooks are:

[1]   Tamer Başar and Geert Olsder. Dynamic Noncooperative Game Theory, 2nd ed. SIAM Classics in Applied Mathematics, 1999. ISBN 0-89871-429-X.

[2]   Drew Fudenberg and Jean Tirole. Game Theory. MIT Press, 1991. ISBN 0-262-06141-4

[3]   D. Bertsekas. Dynamic programming and optimal control, vol I, Athena Scientific. Athena Scientific, 1995. ISBN 1-886529-12-4.

The lectures follow closely the following (very preliminary) lecture notes:

http://www.ece.ucsb.edu/~hespanha/published/games.pdf

Projects

The following two types of projects are possible in this course:

1.       Solution of a research problem relevant to the student’s area of research

2.       Independent study of a topic not covered in class (e.g., reading a paper or book chapter). Possible topics include:

a.       N-person games in extensive form (Sec 3.5 of [1])

b.       Static infinite games (Ch. 4 of [1])

c.       Learning and evolution in repeated games (pp. 23-29 of [2])

d.       Markov games

e.       Computer network games (security, resource management)

f.        Pursuit-evasion (Ch. 8 of [1])

g.       Cooperative games

A one-page project proposal is due on Nov 1st

Detailed Syllabus

The following is a tentative schedule for the course. As revisions are needed, they will be posted on the course's web page. The rightmost column of the schedule contains the recommended reading for the topics covered on each class. Students are strongly encouraged to read these materials prior to the class.

Class

Content

References

#1, Set 27

No class

 

#2, Oct 2

Course overview

Syllabus

#3, Oct 4

Noncooperative games

1           Players, actions, and policies

2           Information structure

3           Optimality vs. equilibrium

4           Static vs. Dynamic games

Ch 1 of [1]

#4, Oct 9

Zero-sum games: Matrix form

  1. Matrix/normal/strategic form
  2. Security policies
  3. Saddle-point policies
  4. Mixed policies

Secs 2.1-2.2 of [1]

#5, Oct 11

  1. Minimax Theorem
  2. Computation of mixed policies: graphical method

Secs 2.2-2.3 of [1]

#6, Oct 16

No class

 

#7, Oct 18

  1. Computation of mixed policies: LP
  2. Dominating policies

Secs 2.3-2.4 of [1]

 

#8, Oct 23

Zero-sum games in Extensive form

1.       Extensive/tree form

2.       Saddle-point policies

Sec 2.4 of [1]

#9, Oct 25

3.       Feedback games

4.       Feedback saddle-point policies

5.       Mixed policies

 

#10, Oct 30

6.       Behavioral policies

Sec 2.5 of [1]

#11, Nov 1

Bimatrix games

1.       Nash equilibrium

2.       Mixed policies

3.       Existence of Nash equilibrium in mixed policies

4.       Computation of mixed policies: QP

Secs 3.1-3.2, 3.4 of [1]

#12, Nov 6

No class

#13, Nov 8

N-person games: Matrix form

1.       Nash equilibrium

2.       Mixed policies

Existence of Nash equilibrium in mixed policies

 

Secs 3.3-3.5 of [1]

#14, Nov 13

Static infinite games

1.       e-Nash equilibrium

2.       Pure vs. mixed policies

3.       Semi-infinite bimatrix games

4.       Continuous kernel games

Secs 4.1-4.4

#15, Nov 15

Infinite dynamic games

1.       Discrete-time dynamic games

2.       Continuous-time dynamic games

Secs 5.1-5.3

#16, Nov 20

Dynamic (one player) optimization (discrete- and continuous-time)

1.       Dynamic programming

Secs 5.5-5.6

Nov 22

Thanksgiving (no class)

 

#17, Nov 27

2.       Minimum principle

Dynamic zero-sum games (discrete- and continuous-time)

1.       Necessary conditions for open-loop equilibrium

Secs  5.5-5.6, 6.1-6.2, 6.5

#18, Nov 29

2.       Sufficient conditions for open-loop equilibrium

3.       Sufficient conditions for feedback equilibrium

4.       Linear quadratic games (H-infinity control)

Secs 6.1-6.2, 6.5

#19, Dec 4

Project presentations

 

#20, Dec 6

Project presentations

 


Homework

Number

Posted on

Due date

Exercises

1

Oct 13

Oct 25

Download problems from here

Solutions are available here. Please print, fill in, and turn in the last page with your homework.

Oct 31

Nov 13

Download problems from here.

Solutions are available here. Please print, fill in, and turn in the last page with your homework.

3

Nov 17

Nov 27

Download problems from here.

Solutions are available here. Please print, fill in, and turn in the last page with your homework.