Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 4 days ago • 110
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 137
INTELLECT-3 Collection INTELLECT-3: A 100B+ MoE trained with large-scale RL • 4 items • Updated 7 days ago • 11
AICC: Parse HTML Finer, Make Models Better -- A 7.3T AI-Ready Corpus Built by a Model-Based HTML Parser Paper • 2511.16397 • Published 16 days ago • 7
The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification Paper • 2511.15622 • Published 17 days ago • 1
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks +2 15 days ago • 19
BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer Paper • 2511.15090 • Published 17 days ago • 1
CASTELLA: Long Audio Dataset with Captions and Temporal Boundaries Paper • 2511.15131 • Published 17 days ago • 1
view article Article We’re open-sourcing our text-to-image model and the process behind it 24 days ago • 73
E-MM1 Collection Multimodal embedding model, supporting datasets, and a paper describing the process going into building both the datasets and the models 🤗 • 6 items • Updated 15 days ago • 10