Right, but we do have a reward function that says pleasurable/painful and novel/...

fallingfrog on March 14, 2016 | parent | context | favorite | on: Yann LeCun's comment on AlphaGo and true AI

Right, but we do have a reward function that says pleasurable/painful and novel/boring and probably other stuff too. So that can be viewed as a labeling on the data. Earlier data can be associated with the reward labeling through induction; that's how a recurrent neural net works. Doubtless that's an oversimplification though.