SemiAnalysis: Scaling Reinforcement Learning: Environments, Reward Hacking, Agents, Scaling Data.