Generative Verifiers: Reward Modeling as Next-Token Prediction Paper โข 2408.15240 โข Published 23 days ago โข 12 โข 2