Exercise 21.4

Table of Contents

Part Ⅰ Artificial Intelligence
1. 1. Introduction
2. 2. Intelligent Agent
Part Ⅱ Problem-solving
Part Ⅲ Knowledge, reasoning, and planning
Part Ⅳ Uncertain knowledge and reasoning
Part Ⅴ Learning
Part Ⅵ Communicating, perceiving, and acting
Part Ⅶ Conclusions
1. 26. Philosophical Foundations
2. Future Exercises

The direct utility estimation method in Section passive-rl-section uses distinguished terminal states to indicate the end of a trial. How could it be modified for environments with discounted rewards and no terminal states?

Answer Improve This Solution

View Answer

Request Answer

Aritificial Intelligence: A Modern Approach

Stuart J. Russell and Peter Norvig