Exercise 21.9

Table of Contents

Part Ⅰ Artificial Intelligence
1. 1. Introduction
2. 2. Intelligent Agent
Part Ⅱ Problem-solving
Part Ⅲ Knowledge, reasoning, and planning
Part Ⅳ Uncertain knowledge and reasoning
Part Ⅴ Learning
Part Ⅵ Communicating, perceiving, and acting
Part Ⅶ Conclusions
1. 26. Philosophical Foundations
2. Future Exercises

Extend the standard game-playing environment (Chapter game-playing-chapter) to incorporate a reward signal. Put two reinforcement learning agents into the environment (they may, of course, share the agent program) and have them play against each other. Apply the generalized TD update rule (Equation (generalized-td-equation)) to update the evaluation function. You might wish to start with a simple linear weighted evaluation function and a simple game, such as tic-tac-toe.

Answer Improve This Solution

View Answer

Request Answer

Aritificial Intelligence: A Modern Approach

Stuart J. Russell and Peter Norvig