After that, Composer 1.5 scaled reinforcement learning by over 20x. Composer 2 then added continued pretraining, reaching ...