Recursive robust estimation and control without commitment. Hansen, L. P. & Sargent, T. J. Journal of Economic Theory, 136(1):1–27, September, 2007.
Recursive robust estimation and control without commitment [link]Paper  doi  abstract   bibtex   
In a Markov decision problem with hidden state variables, a posterior distribution serves as a state variable and Bayes’ law under an approximating model gives its law of motion. A decision maker expresses fear that his model is misspecified by surrounding it with a set of alternatives that are nearby when measured by their expected log likelihood ratios (entropies). Martingales represent alternative models. A decision maker constructs a sequence of robust decision rules by pretending that a sequence of minimizing players choose increments to martingales and distortions to the prior over the hidden state. A risk sensitivity operator induces robustness to perturbations of the approximating model conditioned on the hidden state. Another risk sensitivity operator induces robustness to the prior distribution over the hidden state. We use these operators to extend the approach of Hansen and Sargent [Discounted linear exponential quadratic Gaussian control, IEEE Trans. Automat. Control 40(5) (1995) 968–971] to problems that contain hidden states.
@article{hansen_recursive_2007,
	title = {Recursive robust estimation and control without commitment},
	volume = {136},
	issn = {00220531},
	url = {http://linkinghub.elsevier.com/retrieve/pii/S0022053106001736},
	doi = {10.1016/j.jet.2006.06.010},
	abstract = {In a Markov decision problem with hidden state variables, a posterior distribution serves as a state variable and Bayes’ law under an approximating model gives its law of motion. A decision maker expresses fear that his model is misspecified by surrounding it with a set of alternatives that are nearby when measured by their expected log likelihood ratios (entropies). Martingales represent alternative models. A decision maker constructs a sequence of robust decision rules by pretending that a sequence of minimizing players choose increments to martingales and distortions to the prior over the hidden state. A risk sensitivity operator induces robustness to perturbations of the approximating model conditioned on the hidden state. Another risk sensitivity operator induces robustness to the prior distribution over the hidden state. We use these operators to extend the approach of Hansen and Sargent [Discounted linear exponential quadratic Gaussian control, IEEE Trans. Automat. Control 40(5) (1995) 968–971] to problems that contain hidden states.},
	language = {en},
	number = {1},
	urldate = {2017-09-10},
	journal = {Journal of Economic Theory},
	author = {Hansen, Lars Peter and Sargent, Thomas J.},
	month = sep,
	year = {2007},
	keywords = {DR, Untagged},
	pages = {1--27},
}

Downloads: 0