AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models Paper β’ 2511.14295 β’ Published 21 days ago β’ 71
Huxley-GΓΆdel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine Paper β’ 2510.21614 β’ Published Oct 24 β’ 22
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale Paper β’ 2509.14008 β’ Published Sep 17 β’ 88
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL +4 Jun 3 β’ 96
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 β’ 40 items β’ Updated Jul 21 β’ 348
Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language Models Paper β’ 2503.08275 β’ Published Mar 11 β’ 4
Running 3.55k The Ultra-Scale Playbook π 3.55k The ultimate guide to training LLM on large GPU Clusters
unsloth/Llama-3.3-70B-Instruct-bnb-4bit Text Generation β’ 71B β’ Updated 15 days ago β’ 20.8k β’ 51
ibnzterrell/Nvidia-Llama-3.1-Nemotron-70B-Instruct-HF-AWQ-INT4 Text Generation β’ 11B β’ Updated Dec 7, 2024 β’ 147 β’ 6
hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4 Text Generation β’ 11B β’ Updated Aug 7, 2024 β’ 96.4k β’ 107