CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing Paper • 2305.11738 • Published May 19, 2023 • 6
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning Paper • 2402.14809 • Published Feb 22 • 2
DRLC: Reinforcement Learning with Dense Rewards from LLM Critic Paper • 2401.07382 • Published Jan 14 • 2