Main Library

9 a.m. - 6 p.m.

Phone: (843) 805-6930

West Ashley Library

9 a.m. - 4 p.m.

Phone: (843) 766-6635

Folly Beach Library

Closed for renovations

Phone: (843) 588-2001

John L. Dart Library

9 a.m. - 6 p.m.

Phone: (843) 722-7550

St. Paul's/Hollywood Library

9 a.m. - 5 p.m.

Phone: (843) 889-3300

Mt. Pleasant Library

9 a.m. – 6 p.m.

Phone: (843) 849-6161

Dorchester Road Library

9 a.m. - 6 p.m.

Phone: (843) 552-6466

Edgar Allan Poe/Sullivan's Island Library

9 a.m. - 6 p.m.

Phone: (843) 883-3914

John's Island Library

9 a.m. - 6 p.m.

Phone: (843) 559-1945

McClellanville Library

Closed for renovations

Phone: (843) 887-3699

Edisto Library

9 a.m. - 3 p.m.

Phone: (843) 869-2355

Wando Mount Pleasant Library

9 a.m. - 6 p.m.

Phone: (843) 805-6888

Email Us

Otranto Road Library

9 a.m. - 6 p.m.

Phone: (843) 572-4094

Hurd/St. Andrews Library

9 a.m. - 6 p.m.

Phone: (843) 766-2546

Baxter-Patrick James Island

9 a.m. - 6 p.m.

Phone: (843) 795-6679

Bees Ferry West Ashley Library

9 a.m. - 6 p.m.

Phone: (843) 805-6892

Village Library

9 a.m. - 6 p.m.

Phone: (843) 884-9741

Keith Summey North Charleston Library

9 a.m. – 6 p.m.

Phone: (843) 744-2489

Mobile Library

9 a.m. - 5 p.m.

Phone: (843) 805-6909

Email Us

Item request has been placed!

Item request cannot be made.

Processing Request

Robust Modified Policy Iteration.

Item request has been placed!

Item request cannot be made.

Processing Request

Read Online Read More Add to Saved list

Author(s): Kaufman, David L.; Schaefer, Andrew J.
Source:
INFORMS Journal on Computing. Summer2013, Vol. 25 Issue 3, p396-410. 15p. 6 Charts, 3 Graphs.

Additional Information
- Subject Terms:
  ROBUST control; ITERATIVE methods (Mathematics); DYNAMIC programming; PROBABILITY theory; MARKOV processes; ALGORITHMS; PROBLEM solving
- Abstract:
  Robust dynamic programming (robust DP) mitigates the effects of ambiguity in transition probabilities on the solutions of Markov decision problems. We consider the computation of robust DP solutions for discrete-stage, infinite-horizon, discounted problems with finite state and action spaces. We present robust modified policy iteration (RMPI) and demonstrate its convergence. RMPI encompasses both of the previously known algorithms, robust value iteration and robust policy iteration. In addition to proposing exact RMPI, in which the "inner problem" is solved precisely, we propose inexact RMPI, in which the inner problem is solved to within a specified tolerance. We also introduce new stopping criteria based on the span seminorm. Finally, we demonstrate through some numerical studies that RMPI can significantly reduce computation time. [ABSTRACT FROM AUTHOR]
- Abstract:
  Copyright of INFORMS Journal on Computing is the property of INFORMS: Institute for Operations Research and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Comments

No Comments.

menu

Robust Modified Policy Iteration.

Contact CCPL

Patron Login

menu

Robust Modified Policy Iteration.

Engage with CCPL

Contact CCPL