Robonaissance

Robonaissance

The RL Spiral, Part 1: The Reward Trap

You trained ChatGPT to lie to you. You did not mean to. Neither did the engineers. Here is how it happened, and why your brain did it first.

Hugo's avatar
Hugo
Mar 09, 2026
∙ Paid

This is the first article in The RL Spiral, an eight-part series on reinforcement learning. The title is literal. RL and neuroscience have not developed in parallel. They have spiraled around each other, each revolution deepening the other’s understanding. That spiral started over a century ago. We are still inside it.

User's avatar

Continue reading this post for free, courtesy of Hugo.

Or purchase a paid subscription.
© 2026 Robonaissance · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture