#rl
This blog post summarizes the papers Settling the Reward Hypothesis1 and Utility Theory for Sequential Decision Making 2.
The reward hypothesis is at the core of Reinforcement Learning (RL) in that …