Self-Taught Self-Correction for Small Language Models Paper • 2503.08681 • Published Mar 11, 2025 • 15
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 2.3M • • 1.42k
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation • 8B • Updated May 29, 2025 • 466k • • 1.01k
google/gemma-3-27b-it-qat-q4_0-gguf Image-Text-to-Text • 27B • Updated Apr 11, 2025 • 11k • 372