gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. β’ 2 items β’ Updated Aug 7 β’ 389
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation Paper β’ 2503.21729 β’ Published Mar 27 β’ 29
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12 β’ 473
emπing series Collection crispy sentence embedding family β’ 5 items β’ Updated Oct 14, 2024 β’ 27
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit β’ 28 items β’ Updated 5 days ago β’ 91
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π Jul 5, 2024 β’ 303
view article Article Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer) +1 Jun 16, 2023 β’ 43
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models Paper β’ 2407.01920 β’ Published Jul 2, 2024 β’ 17
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 18 items β’ Updated 8 days ago β’ 74
Model-Based Control with Sparse Neural Dynamics Paper β’ 2312.12791 β’ Published Dec 20, 2023 β’ 6
Building Cooperative Embodied Agents Modularly with Large Language Models Paper β’ 2307.02485 β’ Published Jul 5, 2023 β’ 11