Generative Verifiers: Reward Modeling as Next-Token Prediction Paper • 2408.15240 • Published 23 days ago • 12