LLMs and reinforcement learning | Structure and Interpretation of Computer Programmers
https://www.sicpers.info/2025/10/llms-and-reinforcement-learning/
To be clear, it isn’t the large language model that engages in reinforcement learning, it’s the person who’s applying the LLM to their task. That’s all that prompt engineering is.