Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lmarena-ai
's Collections
SearchArena
Arena-Hard-Auto
Prompt-to-Leaderboard
Arena-Hard-Auto
updated
Apr 24
An automatic evaluation tool for LLMs.
Upvote
-
Running
7
Arena Hard Viewer
⚡
7
Browse and view model judgments in benchmarks
lmarena-ai/arena-hard-auto
Updated
May 1
•
756
•
6
Upvote
-
Share collection
View history
Collection guide
Browse collections