Free-form datasets, human annotations, and sample-level model outputs for "Answer Matching Outperforms Multiple Choice for Language Model Evaluation"
Nikhil Chandak
nikhilchandak
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 24 hours ago
nikhilchandak/OpenForesight
published
a dataset
4 days ago
nikhilchandak/OpenForesight
liked
a dataset
3 months ago
brendel-group/MATH-Beyond
Organizations
None yet