Anthony R. Cassandra, Leslie P Kaelbling, and Michael L. Littman (1994).
Acting Optimally in Partially Observable Stochastic Domains.
In Proceedings of the Twelfth National Conference on Artificial
Intelligence, pp. 1023-1028.
Lonnie Chrisman (1992), Reinforcement Learning with Perceptual Aliasing: The
Proceedings of the AAAI Conference on Artificial Intelligence,
10, AAAI-92.
Michael L. Littman (2009), A tutorial on partially observable Markov decision processes,
Journal of Mathematical Psychology, Volume 53, Issue 3, June 2009, Pages 119-125.
tools:::Rd_expr_doi("10.1016/j.jmp.2009.01.005")