| computePolicy | Computes the reinforcement learning policy |
| epsilonGreedyActionSelection | Performs \varepsilon-greedy action selection |
| experienceReplay | Performs experience replay |
| gridworldEnvironment | Defines an environment for a gridworld example |
| lookupActionSelection | Converts a name into an action selection function |
| lookupLearningRule | Loads reinforcement learning algorithm |
| policy | Computes the reinforcement learning policy |
| randomActionSelection | Performs random action selection |
| ReinforcementLearning | Performs reinforcement learning |
| replayExperience | Performs experience replay |
| rl | Performs reinforcement learning |
| sampleExperience | Sample state transitions from an environment function |
| sampleGridSequence | Sample grid sequence |
| selectEpsilonGreedyAction | Performs \varepsilon-greedy action selection |
| selectRandomAction | Performs random action selection |
| state | Creates a state representation for arbitrary objects |
| tictactoe | Game states of 100,000 randomly sampled Tic-Tac-Toe games. |