RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models Paper • 2510.10390 • Published Oct 12, 2025 • 4