Blog

Teaching a Language Model Arithmetic with Reinforcement Learning

My experience training a model on the Countdown Numbers Game — and observing it learn to cheat.