Cliff_walking

cliff_walking

<p>The cliff walking gridworld MDP example from Chapter 6 of the textbook
"Reinforcement Learning: An Introduction."</p>

datasets

Provides the infrastructure to define and analyze the solutions of Partially Observable Markov Decision Process (POMDP) models. Interfaces for various exact and approximate solution algorithms are available including value iteration, point-based value iteration and SARSOP. Smallwood and Sondik (1973) <doi:10.1287/opre.21.5.1071>.

Michael Hahsler

pomdp

Infrastructure for Partially Observable Markov Decision
Processes (POMDP)

Hossein Kamalzadeh

Cliff_walking function

Format

Cliff Walking Gridworld MDP — Cliff_walking

Cliff Walking Gridworld MDP

Cliff_walking: Cliff Walking Gridworld MDP

Description

Arguments

Format

Details

References

See Also

Examples