Abstract
Conservative Policy Iteration is a general algorithm for approximate
dynamic programming which comes with 3 important guarantees:
2) The algorithm terminates in a finite number of steps. 3) Upon termination, it returns a policy which is near optimal. |
Charles Rosenberg Last modified: Thu May 9 23:39:10 EDT 2002