← Back to Simulations

⚡ Reward Prediction Error

Dopamine neurons signal reward prediction error (RPE): the difference between received and expected reward. This TD learning signal drives reinforcement learning throughout the brain.

Generation: 0
Expected Value: 0.00
RPE (δ): 0.00
DA Firing: baseline
TD Learning:
• δ = r + γV(s') - V(s)
• δ > 0: Better than expected
• δ < 0: Worse than expected
• δ = 0: As expected