BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks 13 days ago • 30
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published 9 days ago • 40 • 8