ContextualEpochGreedyPolicy

Policy: A Time and Space Efficient Algorithm for Contextual Linear Bandits

Facilitates the simulation and evaluation of context-free
and contextual multi-Armed Bandit policies or algorithms to ease the
implementation, evaluation, and dissemination of both existing and
new bandit algorithms and policies.

Robin van Emden

contextual

Simulation and Analysis of Contextual Multi-Armed Bandit
Policies

Maurits Kaptein

ContextualEpochGreedyPolicy function

<pre>
 policy &lt;- ContextualEpochGreedyPolicy$new(sZl = 10)
</pre>

ContextualEpochGreedyPolicy: Policy: A Time and Space Efficient Algorithm for Contextual Linear Bandits

Description

Arguments

Usage

See Also