Class RouletteWheelExploration
Roulette wheel exploration policy.
Inheritance
System.Object
RouletteWheelExploration
Implements
Inherited Members
System.Object.Equals(System.Object)
System.Object.Equals(System.Object, System.Object)
System.Object.GetHashCode()
System.Object.GetType()
System.Object.MemberwiseClone()
System.Object.ReferenceEquals(System.Object, System.Object)
System.Object.ToString()
Namespace: Mars.Components.Services.Explorations
Assembly: Mars.Components.dll
Syntax
[Serializable]
public class RouletteWheelExploration : IExplorationPolicy
Remarks
The class implements roulette wheel exploration policy. According to the policy, action a at state s is selected with the next probability:
Q( s, a )
p( s, a ) = ------------------
SUM( Q( s, b ) )
b
where Q(s, a) is action's a estimation (usefulness) at state s.
note
The exploration policy may be applied only in cases, when action estimates (usefulness) are represented with positive value greater then 0.
Methods
| Improve this Doc View SourceChooseAction(Double[])
Choose an action.
Declaration
public int ChooseAction(double[] actionEstimates)
Parameters
Type | Name | Description |
---|---|---|
System.Double[] | actionEstimates | Action estimates. |
Returns
Type | Description |
---|---|
System.Int32 | Returns selected action. |
Remarks
The method chooses an action depending on the provided estimates. The
estimates can be any sort of estimate, which values usefulness of the action
(expected summary reward, discounted reward, etc).