optimal_action

Determines the optimal action for a policy (solved POMDP) for a given belief
at a given epoch.

Provides the infrastructure to define and analyze the solutions of Partially Observable Markov Decision Process (POMDP) models. Interfaces for various exact and approximate solution algorithms are available including value iteration, point-based value iteration and SARSOP. Smallwood and Sondik (1973) <doi:10.1287/opre.21.5.1071>.

Michael Hahsler

pomdp

Infrastructure for Partially Observable Markov Decision
Processes (POMDP)

Hossein Kamalzadeh

optimal_action function

<dl><dt>model</dt>
<dd>a solved POMDP.</dd>
<dt>belief</dt>
<dd>The belief (probability distribution over the states) as a
vector or a matrix with multiple belief states as rows. If <code>NULL</code>, then the initial belief of the
model is used.</dd>
<dt>epoch</dt>
<dd>what epoch of the policy should be used. Use 1 for converged policies.</dd></dl>

Arguments

Author

Optimal action for a belief — optimal_action

<dl>

<dt>model</dt>
<dd>a solved POMDP.</dd>


<dt>belief</dt>
<dd>The belief (probability distribution over the states) as a
vector or a matrix with multiple belief states as rows. If <code>NULL</code>, then the initial belief of the
model is used.</dd>


<dt>epoch</dt>
<dd>what epoch of the policy should be used. Use 1 for converged policies.</dd>

</dl>

optimal_action: Optimal action for a belief

Description

Usage

Value

Arguments

Author

See Also

Examples