Skip links
Published on: Mt

late 1980s

Artificial Intelligence (AI): Reinforced Learning (RL: Sutton & Barto) is invented, with an agent interacting with its environment, learning from its actions and the consequences/rewards (this is semi-supervised learning). It is a technique related to Dynamic Programming (Bellman, 1952) that models interactions with the environment as a Markov decision process (but RL does not require an a priori model).