— Ben Sheldon's personal blog and commonplace book.

LLMs and reinforcement learning | Structure and Interpretation of Computer Programmers

https://www.sicpers.info/2025/10/llms-and-reinforcement-learning/

October 13, 2025

To be clear, it isn’t the large language model that engages in reinforcement learning, it’s the person who’s applying the LLM to their task. That’s all that prompt engineering is.