Teaching a Language Model Arithmetic with Reinforcement Learning
My experience training a model on the Countdown Numbers Game — and observing it learn to cheat.
My experience training a model on the Countdown Numbers Game — and observing it learn to cheat.