Refresh ultravox-v0_5-deepseek-r1-llama-3_1-8b result with new quant 1e44766 steampunque commited on 16 days ago
Add refreshed ultravox llama3.1 8B results, add llama3.1 8B Q4_K_H result. 664f945 steampunque commited on 18 days ago
Add Ministral 3 3B Instruct 2512 Q4_K_H and Q6_K_H vision results c7bef93 steampunque commited on 29 days ago
Add MiniCPM-V-4_5 Q4_K_H vision result, rerun Q6_K_H vision result. Rerun LFM2-VL-1.6B vision result. 5733735 steampunque commited on Dec 2, 2025
Add Q4_K_H and Q6_K_H quant results for Qwen 2.5 Omni 3B and VL 3B, 1094d35 steampunque commited on Nov 28, 2025
Add Q4_K_H quant results for Qwen 2.5 VL 7B and Qwen 2.5 Omni 7B e5824d1 steampunque commited on Nov 23, 2025
Refresh CRUXEVALFIM bench, rerun for Qwen 2.5 Coder 7B Instruct Q4_K_H/Q6_K_H 593811c steampunque commited on Nov 13, 2025
Add Qwen 2.5 Coder 7B Instruct Q4_K_H/Q6_K_H results. Fix model size c308807 steampunque commited on Nov 11, 2025
Add Qwen2.5 Omni 3B vision and audio results. Add Qwen3 VL 8B Thinking vision result. 5ad9c28 steampunque commited on Nov 10, 2025
Update Qwen 2.5 Omni 7B vision result with mtmd fixes for Qwen. 1ffcb48 steampunque commited on Nov 5, 2025
Add REALWORLDQA vision test. Update Qwen 2.5 VL 7B result with mtmd fixes for Qwen. f24e884 steampunque commited on Nov 2, 2025
Add Qwen3 thinking 2507 results, clarify think model test methodology in description 9fee3e0 steampunque commited on Oct 15, 2025
Add Qwen3 Coder 30B A3B instruct code results. Remove obsolete code models. a6f6477 steampunque commited on Aug 4, 2025
Add Llama scout vision, QwQ hybrid quants. Remove more obsolete models. 00ada6c steampunque commited on Aug 3, 2025
Remove invalid Llama 3.2 1B ultravox result since llama3.2 1B cannot reliably self grade. 66f8a4a steampunque commited on Jul 30, 2025
Update self grade prompt so self grading for Llama 3.1 8b works, regen result. 75f318c steampunque commited on Jul 29, 2025
update all audio results using CoT and add some new ultravox models 65c038e steampunque commited on Jul 14, 2025