I make the case why people iteratively training any model should learn some core concerns of reinforcement learning.