makeAgent

[<code>character(1)</code> | Policy] A policy.
If you pass a string the policy will be created via <a rd-options="" href="/link/makePolicy?package=reinforcelearn&version=0.2.1" data-mini-rdoc="reinforcelearn::makePolicy">makePolicy</a>.

policy

[<code>character(1)</code> | ValueFunction] A value function representation.
If you pass a string the value function will be created via <a rd-options="" href="/link/makeValueFunction?package=reinforcelearn&version=0.2.1" data-mini-rdoc="reinforcelearn::makeValueFunction">makeValueFunction</a>.

val.fun

[<code>character(1)</code> | Algorithm] An algorithm.
If you pass a string the algorithm will be created via <a rd-options="" href="/link/makeAlgorithm?package=reinforcelearn&version=0.2.1" data-mini-rdoc="reinforcelearn::makeAlgorithm">makeAlgorithm</a>.

algorithm

[<code>function</code>] A function which preprocesses the state so that the agent can learn on this.

preprocess

[<code>ReplayMemory</code>] Replay memory for experience replay created by <a rd-options="" href="/link/makeReplayMemory?package=reinforcelearn&version=0.2.1" data-mini-rdoc="reinforcelearn::makeReplayMemory">makeReplayMemory</a>.

replay.memory

[<code>list</code>] Arguments passed on to <code>args</code> in <a rd-options="" href="/link/makePolicy?package=reinforcelearn&version=0.2.1" data-mini-rdoc="reinforcelearn::makePolicy">makePolicy</a>.

policy.args

[<code>list</code>] Arguments passed on to <code>args</code> in <a rd-options="" href="/link/makeValueFunction?package=reinforcelearn&version=0.2.1" data-mini-rdoc="reinforcelearn::makeValueFunction">makeValueFunction</a>.

val.fun.args

[<code>list</code>] Arguments passed on to <code>args</code> in <a rd-options="" href="/link/makeAlgorithm?package=reinforcelearn&version=0.2.1" data-mini-rdoc="reinforcelearn::makeAlgorithm">makeAlgorithm</a>.

algorithm.args

An agent consists of a policy and (optional) a value function representation
and (optional) a learning algorithm.

Implements reinforcement learning environments and algorithms as described in Sutton & Barto (1998, ISBN:0262193981).
The Q-Learning algorithm can be used with function approximation,
eligibility traces (Singh & Sutton (1996) <doi:10.1007/BF00114726>)
and experience replay (Mnih et al. (2013) <arXiv:1312.5602>).

Markus Dumke

reinforcelearn

Reinforcement Learning

makeAgent function

[<code>character(1)</code> | Policy] A policy.
If you pass a string the policy will be created via <a rd-options='' href='makePolicy'>makePolicy</a>.

[<code>character(1)</code> | ValueFunction] A value function representation.
If you pass a string the value function will be created via <a rd-options='' href='makeValueFunction'>makeValueFunction</a>.

[<code>character(1)</code> | Algorithm] An algorithm.
If you pass a string the algorithm will be created via <a rd-options='' href='makeAlgorithm'>makeAlgorithm</a>.

[<code>ReplayMemory</code>] Replay memory for experience replay created by <a rd-options='' href='makeReplayMemory'>makeReplayMemory</a>.

[<code>list</code>] Arguments passed on to <code>args</code> in <a rd-options='' href='makePolicy'>makePolicy</a>.

[<code>list</code>] Arguments passed on to <code>args</code> in <a rd-options='' href='makeValueFunction'>makeValueFunction</a>.

[<code>list</code>] Arguments passed on to <code>args</code> in <a rd-options='' href='makeAlgorithm'>makeAlgorithm</a>.

makeAgent: Create Agent.

Description

Usage

Arguments

Examples