Toward Socially Aware Vision-Language Models: Evaluating Cultural Competence Through Multimodal Story Generation Paper • 2508.16762 • Published Aug 22
mmJEE-Eval: A Bilingual Multimodal Benchmark for Evaluating Scientific Reasoning in Vision-Language Models Paper • 2511.09339 • Published 24 days ago