Reinforcement learning

Summary