Class EpsilonGreedyExploration
Inheritance
Implements
Inherited Members
Namespace: Mars.Components.Services.Explorations
Assembly: Mars.Components.dll
Syntax
[Serializable]
public class EpsilonGreedyExploration : IExplorationPolicy
Remarks
The class implements epsilon greedy exploration policy. According to the policy, the best action is chosen with probability 1-epsilon. Otherwise, with probability epsilon, any other action, except the best one, is chosen randomly.
According to the policy, the epsilon value is known also as exploration rate.
Constructors
| Improve this Doc View SourceEpsilonGreedyExploration(Double)
Declaration
public EpsilonGreedyExploration(double epsilon)
Parameters
Type | Name | Description |
---|---|---|
System.Double | epsilon | Epsilon value (exploration rate). |
Remarks
The class implements epsilon greedy exploration policy. According to the policy, the best action is chosen with probability 1-epsilon. Otherwise, with probability epsilon, any other action, except the best one, is chosen randomly.
According to the policy, the epsilon value is known also as exploration rate.
See Also
Properties
| Improve this Doc View SourceEpsilon
Declaration
public double Epsilon { get; set; }
Property Value
Type | Description |
---|---|
System.Double |
Remarks
The value determines the amount of exploration driven by the policy. If the value is high, then the policy drives more to exploration - choosing random action, which excludes the best one. If the value is low, then the policy is more greedy - choosing the beat so far action.
See Also
Methods
| Improve this Doc View SourceChooseAction(Double[])
Declaration
public int ChooseAction(double[] actionEstimates)
Parameters
Type | Name | Description |
---|---|---|
System.Double[] | actionEstimates | Action estimates. |
Returns
Type | Description |
---|---|
System.Int32 | Returns selected action. |