powered by
Policy: A Time and Space Efficient Algorithm for Contextual Linear Bandits
policy <- ContextualEpochGreedyPolicy$new(sZl = 10)
Core contextual classes: Bandit, Policy, Simulator, Agent, History, Plot
Bandit
Policy
Simulator
Agent
History
Plot
Bandit subclass examples: BasicBernoulliBandit, ContextualLogitBandit, OfflineReplayEvaluatorBandit
BasicBernoulliBandit
ContextualLogitBandit
OfflineReplayEvaluatorBandit
Policy subclass examples: EpsilonGreedyPolicy, ContextualLinTSPolicy
EpsilonGreedyPolicy
ContextualLinTSPolicy