Exercise 17.3

Table of Contents

Part Ⅰ Artificial Intelligence
1. 1. Introduction
2. 2. Intelligent Agent
Part Ⅱ Problem-solving
Part Ⅲ Knowledge, reasoning, and planning
Part Ⅳ Uncertain knowledge and reasoning
Part Ⅴ Learning
Part Ⅵ Communicating, perceiving, and acting
Part Ⅶ Conclusions
1. 26. Philosophical Foundations
2. Future Exercises

Select a specific member of the set of policies that are optimal for $R(s)>0$ as shown in Figure sequential-decision-policies-figure(b), and calculate the fraction of time the agent spends in each state, in the limit, if the policy is executed forever. (Hint: Construct the state-to-state transition probability matrix corresponding to the policy and see Exercise markov-convergence-exercise.)

Answer Improve This Solution

View Answer

Request Answer

Aritificial Intelligence: A Modern Approach

Stuart J. Russell and Peter Norvig