We are excited to release JudgeBench: a challenging benchmark to evaluate LLM-based judges. Check out our leaderboard and code.