powered by
Epsilon Greedy Policy
[numeric(1) in [0, 1]] Ratio of random exploration in epsilon-greedy action selection.
numeric(1) in [0, 1]
makePolicy("epsilon.greedy", epsilon = 0.1) makePolicy("greedy")
makePolicy("epsilon.greedy", epsilon = 0.1)
makePolicy("greedy")
# NOT RUN { policy = makePolicy("epsilon.greedy", epsilon = 0.1) # }
Run the code above in your browser using DataLab