Header logo is

Monte Carlo methods for exact & efficient solution of the generalized optimality equations

2014

Conference Paper

ei


Previous work has shown that classical sequential decision making rules, including expectimax and minimax, are limit cases of a more general class of bounded rational planning problems that trade off the value and the complexity of the solution, as measured by its information divergence from a given reference. This allows modeling a range of novel planning problems having varying degrees of control due to resource constraints, risk-sensitivity, trust and model uncertainty. However, so far it has been unclear in what sense information constraints relate to the complexity of planning. In this paper, we introduce Monte Carlo methods to solve the generalized optimality equations in an efficient \& exact way when the inverse temperatures in a generalized decision tree are of the same sign. These methods highlight a fundamental relation between inverse temperatures and the number of Monte Carlo proposals. In particular, it is seen that the number of proposals is essentially independent of the size of the decision tree.

Author(s): Ortega, PA and Braun, DA and Tishby, N
Pages: 4322-4327
Year: 2014
Month: June
Publisher: IEEE

Department(s): Empirical Inference
Bibtex Type: Conference Paper (conference)

DOI: 10.1109/ICRA.2014.6907488
Event Name: IEEE International Conference on Robotics and Automation (ICRA 2014)
Event Place: Hong Kong, China

Address: Piscataway, NJ, USA
URL: http://www.kyb.tuebingen.mpg.defileadmin/user_upload/files/publications/2014/ICRA-2014-Ortega.pdf

BibTex

@conference{OrtegaBT2014,
  title = {Monte Carlo methods for exact \& efficient solution of the generalized optimality equations},
  author = {Ortega, PA and Braun, DA and Tishby, N},
  pages = {4322-4327},
  publisher = {IEEE},
  address = {Piscataway, NJ, USA},
  month = jun,
  year = {2014},
  doi = {10.1109/ICRA.2014.6907488},
  url = {http://www.kyb.tuebingen.mpg.defileadmin/user_upload/files/publications/2014/ICRA-2014-Ortega.pdf},
  month_numeric = {6}
}